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Patent 18810-81553 
GENETIC MARKER FOR SPONDYLOEPIMETAPHYSEAL DYSPLASIA 

The U.S. Government has a paid-up license in this invention and the right in 
limited circumstances to require the patent owner to license others on reasonable terms as 
provided for by the terms of grants HD22657 and AR02038 awarded by the NIH. 



BACKGROUND OF THE INVENTION 

Throughout the application various publications are referenced in parentheses. 
The disclosures of these publications in their entireties are hereby incorporated by 
reference in the application in order to more fully describe the state of the art to which 
this invention pertains. 

1 . THE FIELD OF THE INVENTION 

This invention relates to the medical arts. In particular, it relates to a genetic 
marker that is useful for diagnosing or treating spondyloepimetaphyseal dysplasia and for 
identifying genetic carriers of heritable alleles associated with spondyloepimetaphyseal 
dysplasia. 

2. DISCUSSION OF THE RELATED ART 

Osteochondrodysplasias are a genetically heterogeneous group of disorders related 
to cartilage producing cells. Abnormalities in cartilage formation can cause defects in 
bone deposition, skeletal development, linear growth, and the continued maintenance of 
cartilage and bone. (Reviewed in Mundlos, S. and Olsen, B. R., Heritable diseases of the 
skeleton. Part II: Molecular insights into skeletal development-matrix components and 
their homeostasis, FASEB J. ll(4):227-33 [1997]). 

There are numerous and disparate causes of osteochondrodysplasias. (Horton, 
W.A., Molecular genetic basis of the human chondrodysplasias^ Endocrinol. Metab. Clin. 
North Am. 25(3):683-97 [1996]), A significant number are due to defects in the collagen 
genes themselves. (Reviewed in Williams, C.J. and Jiminez, S.A., Heritable diseases of 
cartilage caused by mutations in collagen genes, J. Rheumatol. Suppl. 43:28-33 [1995]; 
Byers, P.H,, Molecular genetics of chondrodysplasias, including clues to development, 
structure, and function, Curr. Opin. Rheumatol. 6(3):345-50 [1994] ). In addition to 
collagen defects, several forms of osteochondrodysplasias are caused by mutations in the 
cartilage oligomeric matrix protein (COMP). (Ikegawa, S., et al. Novel and recurrent 



LAI 351S12v2 



Patent 18810-81553 
COMP mutations in preudoachondroplasia and epiphyseal dysplasia. Hum. Genet. 

103(6);633-8 [1998]; Briggs, M.D., et al. Diverse mutations in the gene for cartilage 

oligomeric matrix protein in the pseudoachondroplasia-multiple epiphyseal dysplasia 

disease spectrum. Am. J. Hum. Genet. 62(2):311-9 [1998]; Briggs, M.D., et al, 

5 Pseudoachondroplasia and multiple epiphyseal dysplasia due to mutations in the 

cartilage matrix protein gene, Nat. Genet. 10(3):330-6 [1995]; Ballo, K, et al. Multiple 

epiphyseal dysplasia, ribbing type: a novel point mutation in the COMP gene in a South 

African family. Am. J. Med. Genet. 68(4):396-400 [1997]). 

Still other osteochondrodysplasias have been found to be caused by defects in 

10 secreted peptide growth factors and their receptors. (E.g., Thomas, J.T., et al, A human 

chondroplysplasia due to a mutation in a TGF~f3 superfamily member, Nat. Genet. 

y 12(3):315-7 [1996]; Bonaventure, 1, et al. Common mutations in the gene encoding 

CO FGFR-3 account for achondroplasia, hypochondroplasia and thanatophoric dysplasia, 

55 Acta Paediatr. Suppl. 417:33-38 [1996]). 

15 Finally, mutations in genes which affect protein sulfation cause some forms of 

» H 

13 osteochondrodysplasia. Protein sulfation is a post-translational modification carried out 

by all cells. (Lipmann, F., Biological sulfate activation and transfer. Science 128:575- 

'jg 580 [1958]). The primary source of sulfur for the sulfation pathway is firee sulfate, which 

fll can be transported into the cytoplasm by one of a variety of transmembrane symporter or 

:J 20 antiporter molecules. (Elgavish, A., et al. Sulfate transport in human lung fibroblasts 

(IMR-90), I Cell Physiol. 125:243-250 [1985]; Markovich, D., etal. Expression cloning 

of rat renal Na^ /SO cotransport, Proc. Natl. Acad. Sci. U.S.A. 90:8073-8077 [1993]; 

Bissig, M., et ah. Functional expression cloning of the canalicular sulfate transport 

system of rat hepatocytes, J. BioLChem. 269:3017-3021 [1994]; Hastbacka, J., etal. The 

25 diastrophic dysplasia gene encodes a novel sulfate transporter: positional cloning by 

fine-structure linkage disequilibrium mapping. Cell 78:1073-1087 [1994]; Everett, L.A, 

et ah , Pendred syndrome is caused by mutations in a putative sulphate transporter gene 

(PDS), Nat Genet. 17:411-422 [1997]). 

Within the cytoplasm, sulfate is activated to a high energy form in two enzymatic 

30 steps (Geller, D.H., et ah, Co-purification and characterization of ATP-sulfurylase and 

adenosine-S'-phosphosulfate kinase from rat chondrosarcoma, J. Biol. Chem. 262:7374- 

7382 [1987]). First, utilizing ATP and sulfate as substrates, an ATP sulfurylase activity 
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catalyzes the synthesis of adenosine 5'-phosphosulfate (APS). Subsequently, an APS 

kinase activity catalyzes the phosphorylation of the APS to generate 3'-phosphoadenosine 

5'-phosphosulfate (PAPS). PAPS is the universal bioactivated sulfate donor used in all 

known post-translational sulfation reactions. 

Secreted extracellular matrix proteins are post-translationally sulfated in the 
Golgi, and delivery of PAPS to the Golgi is mediated by a PAPS translocase activity. 
Microsomal proteins with PAPS binding activity have been identified (Mandon, E.C, et 
al. Purification of the golgi adenosine 3* -phosphate 5^-phosphosulfate transporter, 
homodimer within the membrane, Proc.NatlAcad.Sci.U.S.A. 91:10707-10711 [1994]; 
Ozeran, J.D,, et al. Kinetics of PAPS translocase: evidence for an antiport mechanism. 
Biochemistry 35:3685-3694 [1996]), but the tissue specificity and the contribution of 
these proteins to PAPS transport remains unknown. Following transport, sulfation 
reactions are carried out by substrate-specific sulfotransferases. A major class of 
sulfation substrates within the Golgi is the side-chains of proteoglycans, which are 
abundant structural proteins of the extracellular matrices of many tissues. Proteoglycans 
are particularly abundant in the extracellular matrix of cartilage. 

Direct evidence that sulfation of extracellular matrix proteins is essential for 

proper matrix function was revealed by the identification of mutations in the diastrophic 

dysplasia sulfate transporter, DTDST. (Hastbacka et ah [1994]). Mutations in the DTDST 

gene produce a spectrum of recessively inherited osteochondrodysplasia phenotypes. 

(Hastbacka et al [1994]; Hastbacka, J., et at, Atelosteogenesis type II is caused by 

mutations in the diastrophic dyplasia sulfate transporter gene (DTDST): Evidence for a 

phenotypic series involving three chondrodysplasias^ Am. J. Hum. Genet. 58:255-262 

[1996]; Superti-Furga, et al, A family of chondrodysplasias caused by mutations in the 

diastrophic dysplasia transporter gene and associated with impaired sulfation of 

proteoglycans, Ann. N, Y. Acad. Sci. 785:195-201 [1996]). The severity of the three 

known disorders, i.e., the moderately severe diastrophic dysplasia phenotype and the 

lethal forms, atelosteogenesis type II and achondrogenesis type IB, is correlated with the 

consequences of the mutations on the activity of the transporter. (Superti-Furga, A, et al 

Achondrogenesis type IB is caused by mutations in the diastrophic dysplasia sulfate 

transporter gene. Nature Genet. 12:100-02 (1996). The mutations lead to dramatically 

reduced proteoglycan sulfation in cartilage, particularly the chondroitin sulfate side 
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chains of aggrecan. However, even in the most severe disorder in the group, some 

proteoglycan sulfation can be measured. (Rossi, A., et al^ Undersulfation of 

proteoglycans synthesized by chondrocytes from patient with achondrogenesis type IB 

homozygous for an L483P substitution in the diastrophic dysplasia sulfate transporter^ J. 

Biol. Chem. 271:18456-64 [1996]; Rossi, A., et al, Undersulfation of cartilage 

proteoglycans ex vivo and increased contribution of amino acid sulfur to sulfation in vitro 

in McAlister dysplasia/atelosteogenesis type 2, Eur. J. Biochem. 248:741-47 [1997]; 

Rossi, A., et aL^ Proteoglycan sulfation in cartilage and cell cultures from patients with 

sulfate transporter chondrodyplasias: relationship to clinical severity and indications on 

the role of cellular sulfate production^ Matrix Biology 17:361-69[1998]). 

Biochemical evidence that a defect in another step in the sulfation pathway can 
produce an osteochondrodysplasia phenotype was provided by studies in the 
brachymorphic {bm) mouse. The brachymorphic phenotype is characterized by 
disproportionate short-limb dwarfism, a short spine and tail, and a domed skull. (Lane & 
Dickie [1968]). The brachymorphic mouse also exhibits an increased bleeding time, but 
tests of platelet function, including aggregation and secretion, have so far failed to reveal 
specific functional deficits in brachymorphic platelets. (Rusiniak, M.E., et al^ Molecular 
markers near the mouse brachymorphic (bm) gene, which affects connective tissues and 
bleeding time, Mamm. Genome 7:98-102 [1996]) 

In brachymorphic mice, abnormal growth plates with a structurally abnormal 

cartilage extracellular matrix and short chondrocyte columns, with comparatively 

unaligned cells, are apparent on histologic analysis. (Lane & Dickie [1968]; Orkin et ah 

[1976]; Orkin et al\\911\ Miller, W.A. & Flynn-Miller, K.L., Achondroplastic, 

brachymorphic and stubby chondrodystophies in mice^ J. Comp. Pathol 86:349-63 

[1976]). Cartilage from this recessively inherited mutant phenotype shows small, diffuse 

proteoglycan granules and reduced staining for sulfated glycosaminoglycans, consistent 

with a defect affecting sulfation of the proteoglycans of the cartilage extracellular matrix. 

(Lane, P. & Dickie, M.M., Three recessive mutations producing diproportionate dwarfing 

in mice: achondroplasia, brachymorphic, and stubby, J. Hered. 59:300-08 [1968]). In 

brachymorphic mice, proteoglycan granules show a 50% reduction in size in the reserve 

zone of the grov^h plate matrix, and are difficult to identify in the proliferative and 

hypertrophic zones. (Orkin, R.W., et al, Undersulfated chondroitin sulfate in the 
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cartilage matrix of brachymorphic mice. Dev. Biol. 50:82-94 [1976]; Orkin, R.W., et al. 

Defects in the cartilaginous growth plates of brachymorphic mice, J. Cell Biol 73:287-99 

[1977]). This suggests that reduced proteoglycan sulfation affects the signals that 

regulate growth plate chondrocyte maturation. Heparan sulfate proteoglycans have been 

implicated in the sequestration and presentation of growth factors, particularly fibroblast 

growth factors, to receptors at the cell surface. (Rapraeger, A.C., et al. Requirement of 

heparan sulfate for bFGF-mediated fibroblast growth and myoblast differentiation^ 

Science 252:1705-08 [1991]; Yayon, A., et al.^ Cell surface, heparin-like molecules are 

required for binding of basic Fibroblast Growth Factor to its high-affinity receptor. Cell 

64:841-48 [1991]; Schlessinger, J., et al. Regulation of growth factor activation by 

proteoglycans: What is the role of the low affinity receptors?. Cell 83:357-60 [1995]). 

Skeletal defects in mice lacking heparan sulfate 2-sulfotransferase include dwarfism, with 

shortened long bones, ribs and spine, implying a specific role for heparan sulfate 

proteoglycans in skeletal development. (Bullock, S.L, et ah. Renal agenesis in mice 

homozygous for a gene trap mutation in the gene encoding heparan sulfate 2- 

5////b/ran^^ra5^, Genes Dev. 12:1894-1906 [1998]). 

Indeed, reduced activities of both ATP sulfurylase and APS kinase, and decreased 
synthesis of chondroitin sulfate, have been demonstrated in bm mice. (Schwartz, N.B., et 
ah. Defective PAPS-synthesis in epiphyseal cartilage from brachymorphic mice, 
Biochem. Biophys. Res, Commun. 82:173-78 [1978]; Sugahara, K. & Schwartz, N.B., 
Defect in 3'-phosphoadenosine 5^-phosphosulfate formation in brachymorphic mice, 
Proc. Natl. Acad, Sci, U.S.A. 76:6615-18 [1979]). The ATP sulfurylase and APS kinase 
activities in brachymorphic mouse tissues co-purified. (Geller, D.H., et al. [1987]; Lyle, 
S.^ et a/.. Rat chondrosarcoma ATP sulfurylase and adenosine 5^-phosphosulfate kinase 
reside on a single bifunctional protein. Biochemistry 33:5920-25 [1994]). The functional 
activity of the APS kinase was reduced to a greater extent than was that of the ATP 
sulfurylase, suggesting that brachymorphism resulted from a structural mutation that 
affected channeling APS from the carboxyl-terminal sulfurylase activity to the amino- 
terminal kinase. (Lyle, S., et al, Sulfate-activating enzymes in normal and 
brachymorphic mice: evidence for a channeling defect. Biochemistry 34:940-45 [1995]). 

A cDNA encoding a murine bifunctional ATP-sulfurylase/APS-kinase enzyme 

(known as Papssl, formerly known as Atpskl) was isolated from brachymorphic mice 
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and studied by mutational analysis. (Li, H. et al. The isolation and characterization of 

cDNA encoding the mouse hifunctional ATP sulfurylase-adenosine 5^-phosphosulfate 
kinase^ J. Biol Chem. 270(49):29453-59 [1995]; Deyrup, A.T. et al. Deletion and site- 
directed mutagenesis of the ATP -binding motif (P-Ioop) in the hifunctional murine ATP- 
sulfurylase/adenosine 5^-phosphosulfate kinase enzyme^ J. Biol. Chem. 273(16):9450-56 
[1998]). Subsequently, bifunctional enzymes having ATP-sulfurylase (E.G. 
2. 7. 7. 4)/ APS "kinase (E.G. 2.7.1.25) activities were designated "PAPS synthetase" 
(PAPSS; also known as ASAPK and ATPSK). 

The hm mutation was placed on the mouse phenotypic map by Lane and Dickie 
(1968). Subsequent application of microsatellite markers to mice from a large backcross 
localized the hm mutation to a 2.5 cM region located at approximately 32 cM on mouse 
chromosome 19. (O'Brien, E.P., et al^ Molecular map of chromosome 19 including three 
genes affecting bleeding time: ep, ru and bm, Mamm. Genome 5:356-60 [1994]; 
Rusiniak, M.E., et al [1996]). In addition, a liver cancer susceptibility gene, Hcs6, has 
been localized near bmin mouse strain C3H/He. (Manenti, G., etal [1994]). 

The bm mutation co-localized with a PAPSS gene. (Kurima, K. et al, A member 

of a family of sulfate-activating enzymes causes murine brachymorphism, Proc. Natl. 

Acad, Sci. 95(15):8681-85 [1998]; Sugahara, K. and Schwartz, N.B., Defect in 3'- 

phosphoadenosine 5'-phosphosulfate formation in brachymorphic mice, Proc. Nat. Acad. 

Sci. 76(1 2): 66 15- 18 [1979]). Although brachymorphic mouse liver shows decreased 

PAPSS activity, the brachymorphic phenotype does not manifest any grossly 

recognizable liver defects. It has been observed that brachymorphic mouse liver has a 

decreased ability to esterify xenobiotic carcinogens with sulfate (Lyman, S.D. & Poland, 

A., Effect of the brachymorphic trait in mice on xenobiotic sulfate ester formation^ 

Biochem. Pharmacol. 32:3345-50 [1983]), which implies that altered susceptibility to 

such agents could result from defects in a PAPSS gene. Gonsistent with this hypothesis, 

brachymorphic mice have decreased susceptibility to carcinogen-induced 

hepatocarcinoma, presumably due to a decreased ability to generate sulfated carcinogens. 

(Boberg, E.W., et al^ Strong evidence from studies with brachymorphic mice and 

pentachlorophenol that V-sulfooxysafrole is the major ultimate electrophilic and 

carcinogenic metabolite of V-hydroxysafrole in mouse liver ^ Cancer Res. 43:5163-73 

[1983]; Lai, C.C., et al. Initiation of hepatocarcinogenesis in infant male B6C3F1 mice 

6 

LAI 351812V2 



Patent 18810-81553 

by N~hydroxy-2-ammofluorene depends primarily on metabolism to N-sulfooxy-2- 

aminofluorene and formation of DNA-(deoxyguanosin-8~yl)~2--aminofluorene adducts. 

Carcinogenesis 8:471-78 [1987]). 

PAPSS enzymes have been isolated and characterized in non-murine systems, for 
example, in association with rat chondrosarcomas. (Rosenthal, E. and Leustek, T., A 
multifunctional Urechis caupo protein, PAPS synthetase, has both ATP sulfurylase and 
APS kinase activities. Gene 165(2):243-48 [1995]; Schwartz, N.B., et al. Sulfate 
activation and transport in mammals; system components and mechanisms^ Chem. Biol 
Interact 109(1-3): 143-51 [1998]; Lyle, S. et al. Rat chondrosarcoma ATP sulfurylase 
and adenosine 5^-phosphosulfate kinase reside on a single bifunctional protein^ 
Biochemistry 33(19):5920-25 [1994]; Lyle, S. et al^ Intermediate channeling between 
ATP sulfurylase and adenosine 5'-phosphosulfate kinase from rat chondrosarcoma^ 
Biochemistry 33(22):6822-27 [1994]). Human cDNAs for PAPSS have also been 
cloned, and their activities have been analyzed biochemically and mutationally. 
(Yanagisawa, K. et ah, cDNA cloning, expression, and characterization of the human 
bifunctional ATP sulfurylase/adenosine 5*-phosphosulfate kinase enzyme^ Biosci. 
BiotechnoL Biochem. 62(5):1037-40 [1998]; Girard, J.P., et al. Sulfation in high 
endothelial venules: cloning and expression of the human PAPS synthetase, FASEB J. 
12(7): 603 -12 [1998]; Venkatachalam, K.V. et ah. Molecular cloning, expression, and 
characterization of human bifunctional 3^-phosphoadenosine 5*-phosphosulfate synthase 
and its functional domains, J. Biol Chem. 273(30): 193 11-20 [1998]; Ventkatachalam, 
K.V., et al. Site-selected mutagenesis of a conserved nucleotide binding HXGH motif 
located in the ATP sulfurylase domain of human bifunctional S'-phosphoadenosine 5- 
phosphosulfate synthase, J. Biol Chem. 274(5):2601-04 [1999]). 

Phenotypically analogous to murine brachymorphism, is spondyloepimetaphyseal 

dysplasia (SEMD) in humans. SEMD is a subgroup of osteochondrodysplasias affecting 

skeletal development, linear bone and cartilage grov^h, and bone and cartilage 

maintenance. Effects of SEMD can include dwarfism, stunted or malformed limbs, 

enlarged joints, kyphoscoliosis (spinal warping), and brachydactyly (short fingers and 

toes). SEMD typically runs in families, and can be inherited in autosomal dominant or 

recessive manners. (Ahmad, M. et al. Distinct, autosomal recessive form of 

spondyloepimetaphyseal dysplasia segregating in an inbred Pakistani kindred. Am. J. 
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Med. Genet. 78(5):468-73 [1998]; Figuera, L.E., et al, Spondyloepimetaphyseal 

dysplasia (SEMD) Shohat type. Am. J. Med. Genet. 51(3):213-15 [1994]; Shohat, M. et 

ah. New form of spondyloepimetaphyseal dysplasia (SEMD) in Jewish family of Iraqi 

origin. Am. J. Med. Genet. 46(4):358-62, [1993]; Whyte, M.P. et al. Hypotrichosis with 

spondyloepimetaphyseal dysplasia in three generations; a new autosomal dominant 

syndrome. Am. J. Med. Genet. 36(3):288-91 [1990]; Gertner, J.M., et al. Linkage studies 

of a Missouri kindred with autosomal dominant spondyloepimetaphyseal dysplasia 

(SEMD) indicate genetic heterogeneity, J. Bone Miner. Res. 12(8): 1204-9, [1997]). 

A nonsense mutation in a novel, cartilage-specific human PAPSS (PAPSS2, 

formerly ATPSK2), has been indicated as the cause of a recessive form of SEMD in an 

inbred Pakistani family. (Ahmad, M., et ah, A distinct, autosomal recessive form of 

spondyloepimetaphyseal dysplasia segregating in an inbred Pakistani kindred. Am. J. 

Med. Genet. 78:468-73 [1998]; ul Haque, M.F., et al. Mutations in orthologous genes in 

human spondyloepimetaphyseal dysplasia and the brachymorphic mouse, Nat. Genet. 

20(2): 157-62 [1998]), Genome-wide linkage studies localized the disease gene for this 

dwarfing condition to chromosome 10q23-24, in a region syntenic with the locus for the 

bm locus on mouse chromosome 19. (ul Haque, M.F., et al [1998]). This disorder, 

designated SEMD Pakistani type, is characterized by short, bowed lower limbs, enlarged 

knee joints, kyphoscoliosis, a mild generalized brachydactyly, and early-onset 

degenerative joint disease in the hands and knees. Radiographs of patients with SEMD 

Pakistani type show delayed epiphyseal ossification, especially at the hips and knees, and 

platyspondyly. 

Currently, there are only a few methods of detecting bone related diseases. (Eg., 

Klock, J.C., Chondroitin sulfate as a marker for bone resorption, U.S. Patent No. 

5,869,273; Takeshita, S. et al. Bone-related sulfatase-like protein and process for its 

production, U.S. Patent No. 5,627,050). DNA-based diagnostic approaches have been 

suggested for some type 2 collagen disorders, such as Stickler syndrome, 

spondyloepiphyseal dysplasia, and achondrogenesis; achondroplasia (a defect in the 

fibroblast growth factor receptor 3 (FGFR3) gene); the collagen oligomeric matrix protein 

(COMP) disorders pseudoachondroplasia and multiple epiphyseal dysplasia, and others. 

(E.g., Ritvaniemi, Arthritis and Rheumatism 38:999-1004 [1995]; Shiang et al. 

Mutations in the transmembrane domain ofFGFRS cause the most common genetic form 

8 

LAI 351812v2 



Patent 18810-81553 
of dwarfism, achondroplasia^ Cell 78:335-42 [1994]; Briggs et al, 

Pseudoachondroplasia and multiple epiphyseal dysplasia due to mutations in the 

cartilage of oligomeric matrix protein gene ^ Nature Genetics 10:330-36 [1995]; Superti- 

Ferga et al , Recessively inherited multiple epiphyseal dysplasia with normal stature, club 

foot, and double layered patella caused by DTDST mutation, J. Med. Genet. 36:621-24 

[1999]). But most reported diagnostic methods relate to osteogenic proteins. (E.g., 

Khandke, K. M., et al^ Chromatographic process for the copurification of chondroitinase 

I and TI proteins from Proteus vulgaris, U.S. Patent No. 5,525,500; Parsons, T. F., et al. 

Osteogenic Factors, U.S. Patent No. 5,106,626; Oppermann, H., et al. Cartilage and 

hone-inducing proteins, U.S. Patent No. 5,750,651). 

Accordingly, a reliable method is still needed for diagnosing SEMD, detecting the 

presence of SEMD in recessive carriers, and for treating osteoarthritic disorders, 

including osteochondrodysplasias, that are caused or aggravated by deficient PAPS 

synthetase activity. The present invention provides these and other advantages, as 

described herein. 

SUMMARY OF THE INVENTION 
The present invention relates to an isolated polynucleotide or to a nucleic acid 
construct that comprises a nucleic acid segment encoding a 3'-phosphoadenosine-5'- 
phospho sulfate (PAPS) synthetase (PAPSS), particularly, a human PAPSS2 nucleotide 
sequence of (SEQ. ID. NO.:l) or an orthologous murine Papssl nucleotide sequence 
(SEQ. ID. NO.:2), sequences complementary to either one of them, degenerate coding 
sequences, or gene-specific fragments of (SEQ. ID. NOS.:l and 2). The present 
polynucleotides and nucleic acid constructs containing PAPSS2 and Papssl nucleotide 
sequences include RNA; DNA; and chimeric RNA/DNA. Embodiments include probes, 
primers, and expression vectors containing PAPSS2- and Pa/?552-specific nucleotide 
sequences. 

The present invention also relates to a genetically modified vertebrate cell 

containing a nucleic acid construct of the present invention and to a non-human vertebrate 

containing the cell. The present invention also relates to human PAPSS2 and murine 

Papss2 proteins encoded by the present polynucleotide or nucleic acid construct, 

including fusion proteins that contain, together with a PAPSS2 or Papss2 amino acid 

sequence, any other predetermined polypeptide sequence. The present invention also 
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relates to antibodies and antibody fragments that selectively bind the PAPSS2 or Papss2 

protein. 

The present invention is also directed to a method of diagnosing 
spondyloepimetaphyseal dysplasia (SEMD) in a human subject. The method involves 
5 amplifying nucleic acids from a sample that define an PAPSS2 gene sequence, or a gene- 
specific fragment thereof; and analyzing the amplified nucleic acids for the presence of 
homozygosity for a variant allele of PAPSS2. The sample is a bodily substance 
containing human nucleic acid, for example a blood sample, obtained fi'om a human 
subject having at least one symptom of SEMD. Homozygosity for a variant allele of 
10 PAPSS2 is diagnostic for SEMD in the human subject. In particular, the present 
invention also relates to a method of diagnosing SEMD Pakistani-type in a human 
subject. But the present invention also relates to a method of identifying a heterozygous 
human carrier of an SEMD-associated allele. 

The present invention provides a genetic testing kit for diagnosing SEMD or for 
15 identifying a human carrier of SEMD, including SEMD Pakistani-type. The genetic 
testing kit contains oligonucleotide primers of the present invention. 

Also, nucleic acid constructs of the present invention are used in a method of gene 
therapy for treating a human subject having an osteoarthritic disorder that is caused or 
aggravated by deficient enzymatic sulfation activity. The present invention is also 
20 related to a protein therapy method for treating a human subject having an osteoarthritic 
disorder that is caused or aggravated by deficient enzymatic sulfation activity. This 
method employs the inventive PAPSS2 fusion protein. 



BRIEF DESCRIPTION OF THE DRAWINGS 



25 





Figure 1 shows a genetic map of the interval containing the SEMD disease gene 
on human chromosome 10. Distances between loci are given in centimorgans (cM). 

Figure 2;sl^ws a genetic map of the bm region of mouse chromosome 19. The 
0.7 cM ipt^m^al containing the disease gene and locus D19Mitl3 is shown with a darker 
linj 



Figure 3 shows a variant allele o/pAPSS2 associated with SEMD Pakistani type. 
Sequences derived from amplified/DNA fragment from an affected family member 
(SEMD) and from a control (1^3^ are shown. An arrow at nucleotide +1424 marks the 
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location of a mutation, and the sequence and the implied effect of the mutation on 

the PAPSS2 protein sequenc^fs summarized below the nucleotide sequence. 

Figure 4 compares expression of PAPSSl and PAPSS2 mRNAs with an actin 

mRNA control in samples from various human tissues, as visualized by Northern 

hybridization with specific cDNA probes for each. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

tKO The present invention relates to an isolated polynucleotide, of natural origin or a 

nucleic acid construct, that comprises a nucleic acid segment encoding an ATP 
sulfurylase/APS kinase, i.e., a PAPS synthetase (PAPSS), particularly, a humzxi PAPSS2 
nucleotide sequence of (SEQ. ID. NO.:l) or an orthologous murine Papss2 nucleotide 
sequence (SEQ. ID. NO.:2). 

The human PAPSS2 gene is located on chromosome 10 at locus 10q23-24. (ul 
Haque, M.F., et al [1998]). More specifically, the PAPSS2 gene region is flanked by 
microsatellite markers D10S1143 and D10S2470. (See Figure 1; C. Dib et al, A 
comprehensive genetic map of the human genome based on 5,264 microsatellites^ Nature 
380(6570): 152-54 [1996]), The cloned human PAPSS2 nucleotide sequence is the 
following (GenBank accession AF091242; transcriptional start site [underlined] 
commences at position +1): 

-80 CTGCTGCCGC CGCCGCCGCC GCCGTCCCTG CGTCCTTCGG TCTCTGCTCC CGGGACCCGG 

-20 CTCCGCCGCA GCCAGCCAGC ATGTCGGGGA TCAAGAAGCA AAAGACGGAG AACCAGCAGA 

+41 AATCCACCAA TGTAGTCTAT CAGGCCCACC ATGTGAGCAG GAATAAGAGA GGGCAAGTGG 

+ 101 TTGGAACAAG GGGTGGGTTC CGAGGATGTA CCGTGTGGCT AACAGGTCTC TCTGGTGCTG 

+ 161 GAAAAACAAC GATAAGTTTT GCCCTGGAGG AGTACCTTGT CTCCCATGCC ATCCCTTGTT 

+22 1 ACTCCCTGGA TGGGGACAAT GTCCGTCATG GCCTTAACAG AAATCTCGGA TTCTCTCCTG 

+28 1 GGGACAGAGA GGAAAATATC CGCCGGATTG CTGAGGTGGC TAAGCTGTTT GCTGATGCTG 

+34 1 GTCTGGTCTG CATTACCAGC TTTATTTCTC CATTCGCAAA GGATCGTGAG AATGCCCGCA 

+401 AAATACATGA ATCAGCAGGG CTGCCATTCT TTGAAATATT TGTAGATGCA CCTCTAAATA 

+461 TTTGTGAAAG CAGAGACGTA AAAGGCCTCT ATAAAAGGGC CAGAGCTGGG GAGATTAAAG 

+52 1 GATTTACAGG TATTGATTCT GATTATGAGA AACCTGAAAC TCCTGAGCGT GTGCTTAAAA 

+581 CC AATTTGTC C AC AGTGAGT GACTGTGTCC ACC AGGTAGT GGAACTTCTG C AAGAGCAG A 

+64 1 ACATTGTACC CTATACTATA ATCAAAGATA TCCACGAACT CTTTGTGCCG GAAAACAAAC 

+70 1 TTGACCACGT CCGAGCTGAG GCTGAAACTC TCCCTTCATT ATCAATTACT AAGCTGGATC 
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+76 1 TCCAGTGGGT CCAGGTTTTG AGCGAAGGCT GGGCCACTCC CCTCAAAGGT TTCATGCGGG 
+82 1 AGAAGGAGTA CTTACAGGTT ATGCACTTTG ACACCCTGCT AGATGATGGC GTGATCAACA 
+88 1 TGAGCATCCC CATTGTACTG CCCGTCTCTG CAGAGGATAA GACACGGCTG GAAGGGTGCA 
+941 GCAAGTTTGT CCTGGCACAT GGTGGACGGA GGGTAGCTAT CTTACGAGAC GCTGAATTCT 
+ 1001 ATGAACACAG AAAAGAGGAA CGCTGTTCCC GTGTTTGGGG GAC AAC ATGT AC AAAAC ACC 
+ 1061 CCCATATCAA AATGGTGATG GAAAGTGGGG ACTGGCTGGT TGGTGGAGAC CTTCAGGTGC 
+ 1121 TGGAGAAAAT AAGATGGAAT GATGGGCTGG ACCAATACCG TCTGAC ACCT CTGGAGCTC A 
+ 1181 AACAGAAATG TAAAGAAATG AATGGTGATG CGGTGTTTGC ATTCC AGTTG CGCAATCCTG 
+ 1 24 1 TCCAC AATGG CC ATGCCCTG TTGATGC AGG AC ACCTGCCG CAGGCTCCTA GAGAGGGGCT 
+ 1301 AC AAGC ACCC GGTCCTCCT A CTACACCCTC TGGGCGGCTG GACC AAGGAT GACGATGTGC 
+1361 CTCTAGACTG GCGGATGAAG CAGC ACGCGG CTGTGCTCGA GGAAGGGGTC CTGGATCCC A 
+ 1 42 1 AGTC AACC AT TGTTGCC ATC TTTCCGTCTC CC ATGTTATA TGCTGGCCCC AC AGAGGTCC 
+ 1481 AGTGGC ACTG C AGGTCCCGG ATGATTGCGG GTGCCAATTT CTAC ATTGTG GGGAGGGACC 
+ 1541 CTGCAGGAAT GCCCCATCCT GAAACCAAGA AGGATCTGTA TGAACCCACT CATGGGGGC A 
+ 1601 AGGTCTTGAG CATGGCCCCT GGCCTCACCT CTGTGGAAAT CATTCCATTC CGAGTGGCTG 
+ 1661 CCTACAACAA AGCCAAAAAA GCCATGGACT TCTATGATCC AGCAAGGCAC AATGAGTTTG 
+ 1721 ACTTCATCTC AGGAACTCGA ATGAGGAAGC TCGCCCGGGA AGGAGAGAAT CCCCCAGATG 
+ 1781 GCTTCATGGC CCCC AAAGC A TGGAAGGTCC TGAC AGATTA TTAC AGGTCC CTGGAGAAGA 
+ 1841 ACTAAGCCTT TGGGTCCAGA GTTTCTTTCT GAAGTGCTCT TTGATTACCT TTTCTATTTT 
+1901 TATGATTAGA TGCTTTGTAT TAAATTGCTT CTCA// (SEQ . ID . NO . : 1 ) . 

Preferably, the nucleic acid segment encoding a human PAPS synthetase, specifically 
PAPSS2, comprises a nucleotide sequence defining an open reading frame within SEQ. 
ID. NO. : 1 that extends from nucleotide position +1 through +1 845 (SEQ. ID. NO. :9). 

The orthologous murine Papssl gene is located in the hm region of mouse 
chromosome 19, within a 0.7 cM interval near marker D19Mitl3, (See^ Figure 2; ul 
Haque, M.F., et al [1998]). The cloned murine Papssl nucleotide sequence is the 
following (GenBank accession AF085144; transcriptional start site [underlined] 
commences at position +1): 

-60 GTATTCTCAA CATCAGATAT CATGTCTTGG AGGAAGTTAC CTAAACTCTG AAGAATTATC 

+ 1 ATGTCTGCAA ATTTCAAAAT GAACC ATAAA AGAGACCAGC AAAAATCCAC CAATGTGGTC 

+6 1 TACCAGGCCC ATCATGTGAG CAGGAACAAG AGAGGACAAG TGGTTGGAAC CAGGGGAGGA 

+ 121 TTCCGAGGAT GTACCGTGTG GCTAACAGGT CTCTCTGGTG CTGGGAAAAC AACCATAAGC 

+ 181 TTTGCTTTGG AAGAGTACCT TGTATCTCAC GCCATCCCAT GTTACTCCCT GGATGGGGAC 
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+24 1 AATGTCCGTC ATGGCCTTAA TAAGAACCTG GGATTCTCTG CCGGGGACCG AGAAGAGAAT 
+301 ATCCGCCGGA TCGCGGAGGT GGCCAAGCTC TTTGCCGACG CCGGCCTGGT TTGCATCACC 
+3 6 1 AGCTTTATCT CTCCTTTTGC AAAGGATCGT GAGAATGCCC GAAAAATCC A CGAATC AGC A 
+42 1 GGACTCCCGT TCTTTGAGAT CTTTGTAGAT GCGCCTTTAA ATATCTGTGA AAGCCGAGAC 
+48 1 GTAAAAGGAC TCTACAAACG AGCCCGAGCA GGAGAGATTA AAGGGTTTAC AGGCATCGAT 
+541 TCTGACTATG AGAAACCTGA AACTCCAGAG TGTGTGCTGA AGACCAACTT GTCTTCAGTA 
+60 1 AGCGACTGTG TGCAACAGGT GGTGGAACTT TTGCAGGAGC AGAACATTGT ACCCCACACC 
+66 1 ACCATCAAAG GCATCCACGA ACTCTTTGTG CCAGAAAACA AAGTCGATCA AATCCGAGCT 
+72 1 GAGGCAGAGA CTCTCCCATC ACTACCAATT ACCAAGCTGG ATCTGCAGTG GGTGCAGATT 
+781 CTGAGTGAAG GCTGGGCCAC TCCCCTCAAA GGCTTTATGC GGGAGAAGGA ATACTTGCAA 
+84 1 ACTCTACACT TCGACACTCT ACTGGACGAT GGAGTCATCA ACATGAGTAT TCCCATTGTA 
+90 1 TTGCCCGTTT CTGCGGATGA CAAGGCACGG CTCGAAGGGT GCAGCAAATT TGCCTTGATG 
+96 1 TACGAAGGTC GGAGGGTCGC TCTATTACAG GACCCTGAAT TCTATGAGCA TAGGAAAGAG 
+ 1 02 1 GAGCGTTGTT CTCGTGTGTG GGGAACAGCC ACTGCAAAGC ACCCCCATAT CAAAATGGTG 
+ 1081 ATGGAAAGTG GGGACTGGCT TGTTGGTGGA GACCTACAGG TGCTAGAGAG AATAAGGTGG 
+ 1 141 GACGATGGGC TGGACCAATA CCGCCTTACG CCTCTGGAAC TCAAACAGAA GTGTAAAGAC 
+ 1201 ATGAATGCTG ATGCCGTGTT TGCATTCC AG TTGCGCAATC CTGTCCAC AA TGGTCATGCC 
+ 126 1 CTCCTGATGC AGGACACCCG CCGCAGGCTC CTGGAGAGGG GTTACAAGCA CCCAGTCCTC 
+ 1321 CTGCTCC ACC CTCTTGGGGG CTGGACC AAG GACGATGACG TACCTCTGG A ATGGAGGATG 
+ 1381 AAACAGC ATG C AGCTGTACT GGAGGAAAGG GTCCTGGATC CCAAGTC AAC TATTGTTGCC 
+ 1 44 1 ATCTTTCC AT CTCCTATGTT ATACGCTGGT CCCAC AGAGG TCCAGTGGC A TTGCAGATGC 
+ 1501 CGGATGATTG C AGGAGCC AA TTTCTAC ATT GTGGGT AGGG ATCCCGC AGG AATGCCCCAT 
+ 1561 CCTGAGACAA AGAAAGACCT ATATGAACCC ACCCACGGGG GCAAGGTCTT GAGTATGGCC 
+ 1621 CCTGGCCTTA CCTCTGTGGA AATAATTCCG TTCCGAGTGG CTGCCTACAA TAAAATTAAA 
+ 1681 AAGGCC ATGG ACTTTTATGA TCCAGC AAGG CACGAGGAGT TTGACTTCAT CTC AGGAACT 
+ 1741 CGCATGAGGA AGCTCGCCCG GGAAGGAGAA GATCCCCCAG ATGGCTTCAT GGCCCCGAAA 
+ 1801 GCGTGGAAAG TGTTGACAGA TTACTACAGG TCTCTGGAGA AGACCAACTA GGTGCTCCTG 
+ 1861 GCTCTGGCTT CTTCCTCAAG TGCTCTCTGA CGATTTTTTT TTTCTATTTT TGTGATTTAG 
+ 1921 CTGCTCTGTA TCCAATTGCA //(SEQ. ID. NO.: 2). 

Preferably, the nucleic acid segment encoding a murine PAPS synthetase, specifically 
Papss2, comprises a nucleotide sequence defining an open reading frame within SEQ. ID. 
NO. :2 that extends from nucleotide position +1 through +1851 (SEQ. ID. NO. : 10). 

In other embodiments, the nucleic acid segment has a nucleotide sequence 
complementary to SEQ. ID. NOS.;l, 2, 9, or 10, or comprises a degenerate coding 

13 

LAI 351812v2 



Patent 18810-81553 
sequence of any of these. Alternatively, the present nucleic acid segment is a gene- 
specific fragment of SEQ. ID. NOS.:!, 2, 9 or 10, or a gene-specific fragment of a 
complementary sequence or degenerate coding sequence. Gene-specific fragments are 
nucleic acid segments having a contiguous nucleotide sequence that is specific to PAPSS2 
ox Papss2, respectively. For example, a gene-specific firagment of SEQ. ID. NO.:l is 
SEQ. ID. NO.:9, and a gene-specific fragment of SEQ. ID. NO.:2 is SEQ. ID. NO.: 10. 
The skilled artisan can readily determine other gene-specific fragments by conducting a 
sequence similarity search of a genomics data base, such as the GenBank database of the 
National Center for Biotechnology Information (NCBI), using a computerized algorithm, 
such as PowerBLAST, QBLAST, PSI-BLAST, PHI-BLAST, gapped or ungapped 
BLAST, or the "Align" program through the Baylor College of Medicine server. (E.g., 
Altchul, S.F., et ah. Gapped BLAST and PSI-BLAST: a new generation of protein 
database search programs^ Nucleic Acids Res. 25(17):3389-402 [1997]; Zhang, L, & 
Madden, TX., PowerBLAST: a new network BLAST application for interactive or 
automated sequence analysis and annotation^ Genome Res. 7(6): 649-56 [1997]; Madden, 
T.L., et al. Applications of network BLAST server^ Methods Enzymol. 266:131-41 
[1996]; Altschul, S,F., et al^ Basic local alignment search tool, J. Mol. Biol. 215(3):403- 
10 [1990]). Preferably, a gene specific fragment is at least 5 to 30 contiguous 
nucleotides long, more preferably at least 6 to 15 contiguous nucleotides long, and most 
preferably at least 7 to 10 contiguous nucleotides long. Embodiments of the isolated 
polynucleotide or nucleic acid construct of the present invention include nucleic acid 
segments comprising RNA, DNA, or chimeric RNA/DNA. 

RNA segments include messenger RNA encoding an PAPSS2 or Papss2 protein, 
or a degenerate coding sequence, or a RNA sequence complementary to any of these. A 
complementary RNA segment can be an antisense RNA or a catalytic RNA sequence, 
such as a "hairpin" or "hammerhead" ribozyme, with a recognition sequence 
complementary to a gene-specific fragment of SEQ. ID, N0S.:1, 2, 9 or 10, or a 
degenerate coding sequence of any of these. 

Other embodiments of the isolated polynucleotide or nucleic acid construct 

include nucleic acid segments of DNA. In some embodiments, the DNA segment 

encoding a PAPSS2 or Papss2 protein is operatively linked to a promoter sequence. The 

promoter sequence can be an autologous PAPSS2 or Papss2 promoter sequence. 
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Alternatively, in a nucleic acid construct of the present invention, any suitable promoter 

sequence can be used, including an autologous promoter, a promoter from a heterologous 

human or non-human PAPS synthetase gene (e.g., PAPSSl or Papssl promoters), or a 

predetermined promoter sequence from any other human or non-human gene. Useful 

5 promoter sequences include constitutive promoters, such as, but not limited to, 

cytomegalovirus (CMV) promoter, or inducible promoters, such as, but not limited to, the 

human C-reactive protein (CRP) promoter (e.g., Kanzler, S., et al^ TGF-hetal in liver 

fibrosis: an inducible transgenic mouse model to study liver fibrogenesis^ Am. J. Physiol. 

276(4Pt l):G1059-68 [1999]), or the insulin-like growth factor (IGF-I) promoter (e.g., 

10 Meton I., et al. Growth hormone induces insulin-like growth factor-I gene transcription 
by synergistic action of STATS and HNF-1 alpha, FEES Lett. 444(2-3): 155-59 [1999]). 
Useful promoters include those that promote transcription in cells of diverse tissues, such 
as, but not limited to, an insulin receptor (IR) gene promoter (e.g., Tewari, D.S., et al. 
Characterization of the promoter region and 3' end of the human insulin receptor gene^ J. 

15 Biol Chem. 264(27): 16238-45 [1989]); growth hormone receptor (GHR) P2 or P3 
promoters (e.g., Jiang, H., et al^ Isolation and characterization of a novel promoter for 
the bovine growth hormone receptor gene, J. Biol Chem. 274(12):7893-900 [1999]); or a 
leptin promoter (e.g., Chen, X.L., et al. Analysis of a 762-bp proximal leptin promoter to 
drive and control regulation oftransgene expression of growth hormone receptor in mice, 

20 Biochem. Biophys. Res. Commun. 262(1): 187-92 [1999]). 

Also useful for various applications are tissue-selective promoters, i.e., promoters 
from which expression occurs preferentially in cells of a particular kind of tissue, 
compared to one or more other types of tissue. These include, cartilage-selective 
promoters for expression in chondrocytes, for example, an osteocalcin (OC) promoter 

25 (e.g., Newberry, E.P., et al. The REM domain of MINT, a novel Msx2 binding protein, 
recognizes and regulates the rat osteocalcin promoter. Biochemistry 38(33): 10678-90 
[1999]); a SOX9 promoter, aggrecan gene promoter (AGCl), or collagen oHgomeric 
matrix protein (COMP) gene promoter (e.g., Kanai, Y. & Koopman, P., Structural and 
functional characterization of the mouse Sox9 promoter: implications for campomelic 

30 dysplasia. Hum. Mol. Genet. 8(4):691-96 [1999]; Newton et al. Characterization of 

human and mouse cartilage oligomeric matrix protein. Genomics 24:435-39 [1994); or a 

promoter from a collagen gene, such as, but not limited to promoters for C0L2A1, 
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COL9A1, or COLIOAI. (e.g., Ganguly, A., et ah. Targeted insertions of two exogenous 

collagen genes into both alleles of their endogenous loci in cultured human cells: the 

insertions are directed by relatively short fragments containing the promoters and the 5' 

ends of the genes, Proc Natl Acad Sci USA 91(15):7365-9 [1994]; Dharmavaram, R. M., 

et al. Detection and characterization of Spl binding activity in human chondrocytes and 

its alterations during chondrocyte dedifferentiation, J. Biol Chem 272(43) :269 18-25 

[1997]; Zhou, G., et al.^ Three high mobility group-like sequences within a 48-base pair 

enhancer of the Col2al gene are required for cartilage-specific expression in vivo, J. Biol 

Chem 273(24): 14989-97 [1998]; Seghatoleslami, M. R., etaL, Differential regulation of 

COL2A1 expression in developing and mature chondrocytes^ Matrix Biol 14(9):753-64 

[1995]; Lefebvre, V., et al., An IS-base-pair sequence in the mouse proalphal(II) 

collagen gene is sufficient for expression in cartilage and binds nuclear proteins that are 

selectively expressed in chondrocytes, Mol Cell Biol 16(8);4512-23 [1996]; Zhou, G., et 

ah, A 182 bp fragment of the mouse pro alpha 1(11) collagen gene is sufficient to direct 

chondrocyte expression in transgenic mice, J. Cell Sci, 108(Pt 12):3677-84 [3677-84]; 

Mukhopadhyay, K., et al^ Use of a new rat chondrosarcoma cell line to delineate a 119- 

base pair chondrocyte-specific enhancer element and to define active promoter segments 

in the mouse pro-alpha 1(11) collagen gene, J. Biol Chem 270(46):2771 1-9 [1995]; 

Vikkula, M., et aL, Structural analysis of the regulatory elements of the tyupe-II 

procollagen gene. Conservation of promoter and first intron sequences between human 

and mouse, Biochem J 285(Pt l):287-94 [1992]; Beier, F., et al. Localization of silencer 

and enhancer elements in the human type X collagen gene, J Cell Biochem 662(2):210-8 

[1997]; Thomas, J. T., Sequence comparison of three mammalian type-X collagen 

promoters and preliminary functional analysis of the human promoter. Gene 1 60(2): 291- 

6 [1995]; Apte, S. S., Characterization of the mouse type X collagen gene. Matrix 

13(2): 165-79 [1993]). A cartilage-derived retinoic acid-sensitive protein (CD-RAP) 

gene promoter is also useful for cartilage-selective expression by chondrocytes, (e.g., 

Xie, W.F., et al, Transactivation of the mouse cartilage derived retinoic acid-sensitive 

protein gene by Sox9, J. Bone Miner. Res. 14(5):757-63 [1999]). 

For liver-selective expression in hepatocytes, useful promoter sequences include, 

an albumin gene promoter (e.g., Pastore, L., et al. Use of a liver-specific promoter 

reduces immune response to the transgene in adenoviral vectors. Hum. Gen. Ther. 

16 

LAI 351S12v2 



Patent 18810-81553 
10(11):1773-81 [1999]); a CYP7A or CYP7A1 promoter (e.g., Nitta, M., etal, CPF: an 

orphan nuclear receptor that regulates liver-specific expression of the human cholesterol 

Jalpha-hydroxylase gene, Proc. Natl. Acad. Sci. USA 96(12):6660-65 [1999]; Chen, J,, et 

ah^ Hepatocyte nuclear factor 1 binds to and transactivates the human but not the rat 

CYP7A1 promoter, Biochem. Biophys. Res. Commun. 260(3):829-34 [1999] ); a GHRPl 

promoter (e.g., Zou, L., et al^ Isolation of a liver-specific promoter for human growth 

hormone receptor gene. Endocrinology 138(4):1771-74 [1997]; Jiang, H., et al [1999]; 

Adams, T.E., Differential expression of growth hormone receptor messenger RNA from a 

second promoter, Mol Cell Endocrinol. 108(l-2):23-33 [1995]); or a thrombin- 

activatable fibrinolysis inhibitor (TAFI) promoter (e.g., Boffa, M.B., et al. 

Characterization of the gene encoding human TAFI [thromhin-activatahle fibrinolysis 

inhibitor; plasma procarboxypeptidase B] , Biochemistry 38(20):6547-58 [1999]). 

For hematopoietic tissue-selective expression in hematopoietic precursor cells, 
useful promoters include a cyclin AI promoter (e.g., MuUer, C, et al. Cloning of the 
cyclin Al genomic structure and characterization of the promoter region, J. Biol, Chem. 
276(16): 11220-28 [1999]); a CD34 promoter (e.g., Burn, T.C., et al. Hematopoietic stem 
cell specific gene expression, U.S. Patent No. 5,556,954); or an integrin alphallb 
promoter (e.g., Wilcox, D.A., et al.^ Integrin alphallb promoter-targeted expression of 
gene products in megakaryocytes derived from retrovirus-transduced human 
hematopoietic cells, Proc. Natl. Acad. Sci. USA 96(17):9654-59 [1999]). 

The foregoing examples of useful promoter sequences are not an exhaustive list, 
but are merely illustrative of the promoters available to the skilled artisan in practicing the 
present invention. 

For purposes of the present invention, "operatively linked" means that the 
promoter sequence, is located directly upstream from the PAPSS2 or Papss2 coding 
sequence, which is in reading frame, and that both sequences are oriented in a 5* to 3' 
manner, forming a transcriptional unit, such that transcription can take place in vitro in 
the presence of all essential enzymes, transcription factors, co-factors, activators, and 
reactants, under favorable physical conditions, e.g., suitable pH and temperature. This 
does not mean that, in any particular cell, conditions will favor transcription. 

By way of example, transcription of the DNA sequence encoding a PAPSS2 

protein produces mRNA transcript, which is translatable into PAPSS2 protein, or a 
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functional fragment thereof Thus, one benefit of the present invention is that the nucleic 

acid can be used in a genetic replacement therapy for humans to correct clinical disorders 

derived from defective endogenous expression of PAPSS activity. Such defective 

expression can result from, but need not result from, endogenous underexpression of 

functional PAPSS2 protein, in particular. 

Other preferred embodiments of the present nucleic acid construct also include, 
operatively linked as part of the transcriptional unit, a reporter gene sequence encoding a 
reporter protein for facilitating the detection or selection of cells containing the present 
nucleic acid construct and expressing PAPSS 2 or Papss2. Preferably, but not 
necessarily, the reporter gene encodes a fluorescent protein. Fluorescent proteins include 
green fluorescent protein (or enhanced green fluorescent protein), yellow fluorescent 
protein, blue fluorescent protein, a phycobiliprotein, such as phycoerythrin or 
phycocyanin, or any other protein which fluoresces under suitable wave-lengths of light. 
Another reporter gene suitable for some applications is a gene encoding a protein that can 
enzymatically lead to the emission of light from a substrate(s); for purposes of the present 
invention, such a protein is a "light-emitting protein." For example, a light-emitting 
protein includes proteins such as luciferase or apoaequorin. 

Alternatively, the DNA segment encodes a PAPSS2 or Papss2 gene-specific 
antisense RNA, such as an antisense RNA that specifically hybridizes to PAPSS2 mRNA 
or Papss2 mRNA, preventing translation therefrom. 

In another embodiment, the nucleic acid construct comprises a chimeric 
RNA/DNA, such as a chimeric RNA/DNA oligonucleotide. Such RNA/DNA chimers 
are useful, for example, in altering single nucleotide polymorphisms (SNPs) or other 
point mutations in an endogenous PAPSS2 gene sequence. For example, a SNP in 
PAPSS2 associated with encoding a truncated or otherwise dysfunctional PAPSS2 protein 
can be stably repaired using a chimeric RNA/DNA oligonucleotide of the present 
invention. 

In other embodiments, the nucleic acid construct is a probe or primer sequence. 

The inventive probe is a labeled polynucleotide that is useful in hybridization assays, such 

as Southern, Northern, or fluorescence in situ (FISH) hybidizations, for detecting the 

presence in a sample of polynucleotides having a gene-specific PAPSS2 or Papss2 

sequence. The probe is fully complementary to SEQ. ID. NOS.:l, 2, 9, or 10, or is 
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complementary to a gene-specific fragment of any of these. The suitable labels for a 

probe are typically radioisotopes or fluorogens incorporated into or covalently linked to 

the polynucleotide by methods known in the art, and are detectable by conventional 

radiation or fluorescence detecting methods. 
5 The primer of the present invention is an oligonucleotide sequence that is useful in 

amplifying a nucleic acid segment from a sample that defines a nucleotide sequence 

specific to PAPSS2 or Papss2 by known methods of PGR, including RT-PCR. The 

primer is preferably 15 to 30 nucleotides long, and most preferably 17 to 22 nucleotides 

long, but primers as short as 7 contiguous nucleotides may be useful for some gene- 
10 specific sequences. (E.g., Vincent, J., et al, Oligonucleonucleotides as short as 7-mers 

can be used for PCR amplification, DNA Cell Biol. 13(l):75-82 [1994]). 

Examples of useful oligonucleotide primer sequences for amplifying PAPSS2- 

specific nucleic acid segments include primer sequences comprising: 

5'-TGGACCAAGGATGACGATGT-3' (SEQ. ID. NO.: 3; forward primer); 
1 5 5'-CGGAAAGATGGCAACAATGG (SEQ. ID. NO. : 4; reverse primer); 

5'-CTGGTGCTGGAAAAACAACG-3' (SEQ. ID. NO.: 5; forward primer); 

5'-TGCGAATGGAGAAATAAAGCTG (SEQ. ID. NO.: 6; reverse primer); 

5'-GCCAGCCAGCATGTCGGGGAT-3'(SEQ. ID. NO.:ll; forward primer); 

5'-AGCTGAAACTCCTGAGCGTGTGCT-3' (SEQ. ID. NO : 12; forward primer); 
20 5'-GATGTGCCTCTAGACTGGCGG-3' (SEQ. ID. NO. : 13; forward primer); 

5'-GAGCACTTCAGAAAGAAACTCTGG-3'(SEQ. ID. NO : 14; reverse primer); 

5'-CATCCGCCAGTCTAGAGGCAC-3' (SEQ. ID. NO : 15; reverse primer); 

5 '-AGGTGTCAGAeGGTATTGGTC-3 ' (SEQ. ID. NO. : 1 6; reverse primer); 

5'-GTCACTCACTGTGGACAAATTGG-3'(SEQ. ID. N0.:17; reverse primer); 

25 5'-CACCTCAGCAATCCGGCGGAT-3' (SEQ. ID. NO. : 18; reverse primer);or 

5'-GCATGTCCAGACAGACACCAC-3' (SEQ. ID. NO.:28; reverse primer), but a 
complementary nucleotide sequence, or a gene-specific fragment of any of these is also 
useful; also useful are P^P55'2-specific primer sequences that overlap any of the afore- 
mentioned primer sequences at 5 or more contiguous nucleotide positions at their 5' or 3' 
30 end. 
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Useful primers for amplifying Pa/?^^2-specific sequences include primer 

sequences comprising: 

5'-TCTGGCACAAAGAGTTCGTG-3' [SEQ. ID. NO.:19; reverse primer]); 
5'-GCCAGTTTGTAAC CGAGTATTC-3' (SEQ. ID. NO.:20; forward primer); 
5'-GCAATTGGATACAGAGCAGC TA-3' (SEQ. ID. NO.:21; reverse primer); 
5'-GACAATGTCCGTCATGGCCTTA-3' (SEQ. ID. NO.:22; forward primer); 
5'-ATTCCCATTGTATTGCCCGTT-3' (SEQ. ID. NO.:23; forward primer); 
5'-AACGGGCAATACAATGGGAAT-3' (SEQ. ID. NO.:24; reverse primer); 
5'-GATAAAGCTGGTGATGCAAACC-3' (SEQ. ID. NO.:25; reverse primer);5'- 
CATGG GATGGCGTGAGATAC-3' (SEQ. ID. NO.:26; reverse primer); or 
5'-CATAAGCTTTGCTTT GGAAGAGT-3' (SEQ. ID. NO.. 27; forward primer); but a 
complementary nucleotide sequence, or a gene-specific fragment of any of these is also 
useful; also useful are Papss2 specific primer sequences that overlap any of the afore- 
mentioned primer sequences at 5 or more contiguous nucleotide positions at their 5' or 3' 
end. 

The present invention also relates to a pair of oligonucleotide primers (i.e., a 
primer set) comprising at least one forward and at least one reverse oligonucleotide 
primer, which together are capable of producing detectable nucleic acid amplification 
products from the 5' and 3' ends of a PAPSS2 or Papss2 nucleic acid template in a PCR- 
or RT-PCR-based nucleic acid amplification method. 

In some embodiments, the amplification products that the pair of primers can 
amplify include nucleic acid amplification products comprising (SEQ. ID. NO.:l) or 
(SEQ, ID. NO.:9); a nucleotide sequence complementary to either of those; or a PAPSS2 
gene-specific fi"agment of any of these. Examples of this embodiment of the inventive 
pair of oligonucleotide primers include pairs containing a forward primer having a 
nucleotide sequence comprising (SEQ. ID. NOS.: 3, 5, or any of 1 1-13), a complementary 
nucleotide sequence, a gene-specific fragment of either of these, or a nucleotide sequence 
overlapping at 5 or more contiguous nucleotide positions any sequence of those afore- 
mentioned at its 5* or 3' end; and containing a reverse primer having a nucleotide 
sequence comprising (SEQ. ID. NO.: 4, 6, 14-18, or 28), a complementary nucleotide 
sequence, a gene-specific fragment of either of these; or a nucleotide sequence 

overlapping at 5 or more contiguous nucleotide positions any sequence of those afore- 
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mentioned at its 5' or 3' end. For example, useful oligonucleotide primer pairs for 

amplifying ^ PAPSS2 gene sequence include (SEQ. ID. NO.:3 and 4) or (SEQ. ID. NO.:5 

and 6). 

In other embodiments, the amplification products that the pair of oligonucleotide 
primers can amplify include nucleic acid amplification products having (SEQ. ID. NO.:2) 
or (SEQ. ID. NO.: 10); a nucleotide sequence complementary to either of those; or a 
Papss2 gene-specific fragment of any of these. Examples of this embodiment of the 
inventive primer pair include pairs containing a forward primer having a nucleotide 
sequence comprising (SEQ. ID. NO.:20, 22, 23, or 27), a complementary nucleotide 
sequence, a gene-specific fragment of either of these, or a nucleotide sequence 
overlapping at 5 or more contiguous nucleotide positions any sequence of those afore- 
mentioned at its 5' or 3' end; and containing a reverse primer having a nucleotide 
sequence comprising (SEQ. ID. NOS,: 19, 21, or any of 24-26), a complementary 
nucleotide sequence, a gene-specific fragment of either of these; or a nucleotide sequence 
overlapping at 5 or more contiguous nucleotide positions any sequence of those afore- 
mentioned at its 5' or 3' end. 

Other embodiments of the inventive nucleic acid construct include those in which, 
cloned into an expression vector, is a DNA segment encoding a PAPSS2 or Papss2 
protein, operatively linked with any suitable promoter sequence, including a human 
PAPSS2 promoter, a promoter from another human or non-human PAPS synthetase gene 
(e.g., PAPSSl, Papssl, or Papss2 promoters), or a predetermined promoter sequence 
from any other human or non-human gene, as described above. Suitable vector systems 
include, but are not limited to, viral vectors, for example, adenoviruses, adeno-associated 
viruses, retroviruses such as human immune-deficiency virus, other lentiviruses, such as 
Moloney murine leukemia virus and the retrovirus vector derived from Moloney virus 
called vesicular- stomatitis- virus-glycoprotein (VSV-G)-Moloney murine leukemia virus, 
mumps virus, and transfecting fragments of any of these viruses, and other viral DNA 
segments that facilitate the uptake of the desired DNA segment by, and release into, the 
cytoplasm of cells and mixtures thereof All of the above viruses may require 
modification to render them non-pathogenic or less antigenic. Other known vector 
systems, however, are also suitable for containing the DNA segment. 
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The present invention also relates to a genetically modified vertebrate cell 

containing the nucleic acid construct of the present invention, regardless of the method by 

which the construct was introduced into the cell. Embodiments include cartilage-forming 

cells, such as chondrocytes, and liver cells, such as hepatocytes, as well as cells derived 

from other tissues such as, colon, lung, placenta, aortic or other vascular endothelium, 

bone marrow, or heart. 

If the nucleic acid construct includes a fluorescent reporter gene, as described 

above, the genetically modified cells of the present invention can be detected, isolated or 

selected from unmodified cells with the aid of, for example, a flow-activated cell sorter 

(FACS), set at the appropriate wavelength(s), or using conventional microscopic 

technology. 

In particular applications involving a genetically modified cell that expresses 
additional xenogeneic genes from any promoter, this expression may be linked to a 
reporter gene that encodes a different fluorescent or light-emitting protein from the 
reporter gene linked to PAPSS2 or Papss2 expression. Thus, multiple reporters 
fluorescing or emitting at different wavelengths can be chosen and cell selections based 
on the expression of multiple traits can be made. 

Gene delivery to the cell is by any suitable method including in vivo and vitro 
gene delivery methods. (E.g., D.T. Curiel et al, U.S. Patent Nos. 5,521,291 and 
5,547,932). Typically, gene delivery involves exposing a cell to a gene delivery mixture 
that includes preselected genetic material together with an appropriate vector, mixed, for 
example, with an effective amount of lipid transfecting agent (hpofection). The amount 
of each component of the mixture is chosen so that gene delivery to a specific species of 
cell is optimized. Such optimization requires no more than routine experimentation. The 
ratio of DNA to lipid is broad, preferably about 1:1, although other proportions may also 
be utilized depending on the type of lipid agent and the DNA utilized. This proportion is 
not crucial. Other well known gene delivery methods include electroporation or chemical 
methods, (E.g., M. Ostresh, No harriers to entry: transfection tools get biomolecules in 
the door. The Scientist 13(ll):21-23 [1999]). 

"Gene delivery agent", as used herein, means a composition of matter added to the 

genetic material for enhancing the uptake of exogenous DNA segment(s) into a 

vertebrate cell. The enhancement is measured relative to the uptake in the absence of the 
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gene delivery agent. Examples of gene delivery agents include 

adenovirus-transferrin-polylysine-DNA complexes. These complexes generally augment 

the uptake of DNA into the cell and reduce its breakdown during its passage through the 

cytoplasm to the nucleus of the cell. 

Other preferred gene delivery agents include Lipofectin®, DMRIE C, Cellfectin® 
or Lipofectamine (Life Technologies), LipoTAXI (Stratagene), Superfect or Effectene 
(Qiagen), Although these are not as efficient gene delivery agents as viral agents, they 
have the advantage that they facilitate stable integration of xenogeneic DNA sequence 
into the vertebrate genome, v^ithout size restrictions commonly associated with virus- 
derived gene delivery agents. A virus, or transfecting fragment thereof, can be used to 
facilitate the delivery of the genetic material into the cell. Examples of suitable viruses 
include adenoviruses, adeno-associated viruses, retroviruses such as human 
immune-deficiency virus, other lentiviruses, such as Moloney murine leukemia virus and 
the retrovirus vector derived from Moloney virus called vesicular-stomatitis-virus- 
glycoprotein (VSV-G)-Moloney murine leukemia virus, mumps virus, and transfecting 
fragments of any of these viruses, and other viral DNA segments that facilitate the uptake 
of the desired DNA segment by, and release into, the cytoplasm of cells and mixtures 
thereof All of the above viruses may require modification to render them non-pathogenic 
or less antigenic. Other known vector systems are also useful. 

The present invention also relates to genetically modified non-human vertebrates 
comprising the present genetically modified cells, for example, non-human primates, 
mice, rats, rabbits, gerbils, hamsters, canines, felines or other non-human mammals. 
Other vertebrates include birds such as chickens, turkeys, ducks, ostriches, emus, geese, 
guinea fowl, doves, quail, rare and ornamental birds, and the like. Broadly speaking, a 
genetically modified vertebrate is one that has had exogenous or foreign DNA 
permanently introduced into one or more of its cells, whether somatic or germ cells. 

The present invention also relates to an isolated human PAPS synthetase protein 
comprising a polypeptide having an amino acid sequence of (SEQ. TD. NO.:7), i.e., the 
following contiguous amino acid residues, or an antibody binding fragment of the protein 
at least 6 amino acids long (amino acid residue 475 is in boldface): 
MSGIKKQKTENQQKSTNWYQAHHVSRNKRGQVVGTRGGFRGCT 
VWLTGLSGAGKTTISFALEEYLVSPIAIPCYSLDGDNVRHGLNRNLGFSPGDREENIRR 
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lAEVAKLFADAGLVCITSFISPFAKDRENARKIHESAGLPFFEIFVDAPLNICESRDV 

KGLYKRARAGEIKGFTGIDSDYEKPETPERVLKTNLSTVSDCVHQVVELLQEQNIVPY 

TIIKDIHELFVPENKLDHVRAEAETLPSLSITKLDLQWVQVLSEGWATPLKGFMREKE 

YLQVMHFDTLLDDGVINMSIPIVLPVSAEDKTRLEGCSKFVLAHGGRRVAILRDAEFY 

EHRKEERCSRWGTTCTKHPHIKMVMESGDWLVGGDLQVLEKIRWNDGLDQYRLTPLE 

LKQKCKEMNADAVFAFQLRNPVHNGHALLMQDTCRRLLERGYKHPVLLLHPLGGWTK 

D 

DDVPLDWRMKQHAAVLEEGVLDPKSTIVAIFPSPMLYAGPTEVQWHCRSRMIAGANFY 
IVGRDPAGMPHPETKKDLYEPTHGGKVLSMAPGLTSVEIIPFRVAAYNKAKKAMDFYD 
P ARmEFDFISGTRMRKLAREGENPPDGFMAPKAWKVLTDYYRSEMDKN//(SEQ . ID . 
N0.:7) 

This polypeptide sequence is encoded for example by (SEQ. ID. NO.:9), but is also 
encoded by other degenerate coding sequences. 

In one embodiment, the PAPS S2 protein forms part of a PAPSS2 fusion protein 
along with a second predetermined polypeptide segment, which can have an amino acid 
sequence related or unrelated to (SEQ, ID. NO.:7). The PAPSS2 fusion protein includes 
a first PAPSS2 polypeptide segment comprising an amino acid sequence of (SEQ. ID. 
NO.: 7), an enzymatically active fragment thereof, or a gene-specific antibody binding 
fragment of (SEQ. ID. NO.:7) at least 6 amino acids long; and a second predetermined 
polypeptide segment. The fusion protein is produced by known recombinant methods, 
and the second predetermined polypeptide sequence can be any desired polypeptide, for 
example, a known antigenic sequence, such as a specific goat, rabbit, or HLA antigen to 
which immunoreagents of commercially available immunoassay systems are directed. 
Thus, in some applications, conventional immunoassay systems can be used "off the 
shelf to detect and quantify a PAPSS2 fiision protein by detecting the second 
polypeptide sequence. 

In other applications, for example in protein therapy for the treatment of 

osteoarthritic disorders related to deficient enzymatic sulfation, the second polypeptide of 

the fusion protein is a polypeptide that allows the hybrid molecule to enter cells of a 

target tissue, for example, cartilage, liver, or platelets, whether in vitro or in vivo. For 

example, a second polypeptide can be chosen that infiltrates the cells of tissues, such as 

the human immunodeficiency virus (HIV) TAT protein (Schwarze, S.R., et al. In vivo 

protein transduction: delivery of a biologically active protein into the mouse. Science 
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285:1569-72 [1999]), and once within the cells, the PAPSS2 polypeptide segment of the 

fusion protein becomes enzymatically active in the sulfation pathway within the cells. 

The present invention also relates to an isolated murine Papss2 protein comprising 

an amino acid sequence of (SEQ. ID. NO.:8), i.e., the following contiguous amino acid 

residues, or an antibody binding fragment of the protein at least 6 amino acids long: 

MSANFKMNHKRDQQKSTNVVYQAHHVSRNKRGQVVGTRGGFRGC 

TVWLTGLSGAGKTTISFALEEYLVSHAIPCYSLDGDNVRHGLNKNLGFSAGDREE 

NIR 

RIAEVAKLFADAGLVCITSFISPFAKDRENARKIHESAGLPFFEIFVDAPLNICESRD 
VKGLYKRARAGEIKGFTGIDSDYEKPETPECVLKTNLSSVSDCVQQWELLQEQN 
IVP 

HTTIKGIHELFVPENKVDQIRAEAETLPSLPITKLDLQWVQILSEGWATPLKGFMR 

EYLQTLHFDTLLDDGVINMSIPIVLPVSADDKARLEGCSKFALMYEGRRVALLQD 
PEF 

YEHRKEERCSRVWGTATAKHPHIKMVMESGDWLVGGDLQVLERIRWDDGLDQ 
YRLTPL 

ELKQKCKDMNADAVFAFQLRNPVHNGHALLMQDTRRRLLERGYKHPVLLLHPL 
GGWTK 

DDDVPLEWRMKQHAAVLEERVLDPKSTIVAIFPSPMLYAGPTEVQWHCRCRMIA 
GANF 

YIVGRDPAGMPHPETKKDLYEPTHGGKVLSMAPGLTSVEIIPFRVAAYNKIKKAM 
DFY 

DPARHEEFDFISGTRMRKLAREGEDPPDGFMAPKAWKVLTDYYRSEMDKTN//(S 
EQ. ID. NO : 8). 

This polypeptide sequence is encoded for example by (SEQ. ID. NO.: 10), but is also 

encoded by other degenerate coding sequences. 

In one embodiment, the Papss2 protein forms part of a Papss2 fusion protein along 

with a second predetermined polypeptide segment, which can have an amino acid 

sequence related or unrelated to (SEQ. ID. N0.:8). The Papss2 fusion protein includes a 

first Papss2 polypeptide segment comprising an amino acid sequence of (SEQ. ID. 

N0.:8) or a gene-specific antibody binding fragment thereof at least 6 amino acids long; 
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and a second predetermined polypeptide segment. As above, the fusion protein is 

produced by known recombinant methods, and the second predetermined polypeptide 

sequence can be any desired polypeptide, for example, a known antigenic sequence, such 

as a specific goat, rabbit, or HLA antigen to which immunoreagents of commercially 

available immunoassay systems are directed. Thus, in some applications, conventional 

immunoassay systems can be used "off the shelf to detect and quantify a Papss2 fusion 

protein by detecting the second polypeptide sequence. 

The present invention also relates to an isolated antibody or antibody fragment 
that selectively binds a PAPS synthetase protein having an amino acid sequence of 
(SEQ. ID. NO.:7) or (SEQ.ID. NO. 8), or selectively binds an antibody binding fragment 
of either of these at least 6 amino acids long. These are usefiil for detection and 
visualization of PAPSS2 or Papss2 proteins in various applications. The antibody or 
antibody fragment is raised against the PAPSS2 or Papss2 protein of the present 
invention. The antibody is raised in a mammal, from which it is isolated in serum, or is 
most preferably produced in a hybridoma cell line, by known methods, and is a 
monoclonal antibody. Antibody fragments of the present antibody are also useful; these 
include Fab', F(ab')2, or F(v) fragments that are produced by known methods, typically 
by pepsinization of whole antibody. 

The present invention also relates to a method of diagnosing 
spondyloepimetaphyseal dysplasia in a human subject. The method involves amplifying 
a nucleic acid segment from a sample. The sample is derived from a human subject 
having at least one symptom of spondyloepimetaphyseal dysplasia, and the sample is of a 
bodily substance containing human nucleic acid of the subject. The nucleic acid segment 
defines a sequence from human chromosomal region 10q23-24, between microsatellite 
markers D10S1143 and D10S2470, to produce amplification products; and analyzing the 
amplification products for the presence of homozygosity for a variant allele of a PAPSS2 
gene, the presence of homozygosity for the variant allele of the gene corroborating a 
diagnosis of spondyloepimetaphyseal dysplasia in the human subject. 

Any bodily substance containing human nucleic acids from the subject may be 

sampled, collected, and/or transported for the purpose of practicing the method. A most 

preferred and convenient substance for sampling is blood. However, a sample of hair 

root, urine, amniotic fluid, chorionic villus biopsy, skin biopsy, vascular or oral 
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epithelium, spinal fluid, or other biopsy sample of any tissue is also useful. The sample is 

derived from cultured human cells, cell-free extracts, or other specimens indirectly 

derived from a subject's body, as well as from substances taken directly from a subject's 

body. Samples may be stored before amplification and/or analysis by well known storage 

means that will preserve nucleic acids in an analyzable condition, such as quick freezing, 

or a controlled freezing regime, in the presence of a cryoprotectant, for example, dimethyl 

sulfoxide (DMSO), glycerol, or propanediol-sucrose. Samples may also be pooled before 

or after storage for purposes of amplifying their nucleic acids for analysis. 

For purposes of the present invention ^'amplifying a nucleic acid segment " from a 
tissue sample of a subject "to obtain amplification products" encompasses any 
conventional means of amassing sufficient nucleic acid material for analysis. This 
includes, but is not limited to, amplification by conventional polymerase chain reaction 
methods (e.g., PGR, including RT-PCR). Alternatively, the practitioner can amplify the 
human subject's nucleic acids by in vitro cell culture and harvest of the subject's cultured 
cells, or by multiple sampling from the subject's tissues in vivo and pooling of multiple 
tissue samples from a subject. Nucleic acids thus "amplified," if they comprise a 
PAPSS2 gene or a gene-specific part thereof, are "amplification products" for purposes of 
the present invention. 

In a most preferred embodiment of the present method, PGR is used for 
amplifying the subject's nucleic acids, particularly from a PAPSS2 gene. If PGR is used, 
any combination of oligonucleotide primers that will amplify nucleic acid sequences of a 
PAPSS2 gene, or parts thereof, can be employed. Most preferably, amplification of the 
subject's nucleic acids can be achieved using PAPSS2 -specific oligonucleotide primers 
and primer pairs of the present invention, as described above. For example, usefiil 
primers comprise a nucleotide sequence of (SEQ. ID. NOS.:3-6, 11-18, or 28) or a 
PAPSS2-spQcif\o fi^agment of any of these at least 15 nucleotides long. In addition a 
sequence complementary to any of (SEQ. ID. NOS.:3-6, 11-18, or 28) or the PAPSS2- 
specific fragment, or any PAP SS2 -specific nucleotide sequence at least 15 nucleotides 
long and overlapping at 5 or more contiguous nucleotide positions any of (SEQ. ID. 
NOS.:3-6, 11-18, or 28) at its 5' or 3' end, is a usefiil primer sequence. Examples of 
pairs of oligonucleotide primers useful for practicing the method include primer pairs 
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comprising (SEQ. ID. NOS.:3 and 4) or (SEQ. ID. NOS.:5 and 6), but any of the present 

A4P*SS'2- specific primer pairs are useful. 

In a preferred embodiment of the method of the present invention, nucleotide 

sequencing of PAPSS2 can be used to analyze the amplification products of the nucleic 

5 acids in a human subject's tissue sample to detect a variant allele in a PAPSS2 gene. A 

variant allele of PAPSS2 can be detected by any nucleotide sequencing means, for 

example conventional dideoxy sequencing or preferably by using a commercially 

available automated sequencer, then by comparing the subject's nucleotide sequences to 

other known human PAPSS2 sequences available in genomic sequence databases, such as 

10 GenBank. 

In a most preferred embodiment of the present method that employs nucleotide 
sequencing, sequencing of PAPSS2 may be accomplished by using fluorescence-based 
single strand conformation polymorphism analysis (SSCP), a routine and reliable means 
of identifying point mutations, small insertions or deletions. (J.S. Ellison, Fluorescence- 

15 based mutation detection. Single-strand conformation polymorphism analysis [F-SSCP], 
Mol Biotechnol 5 (J): 1 7-3 J [1996]; H. Iwahana et al. Multiple fluorescence-based 
PCR-SSCP analysis using internal fluorescent labeling of PCR products, Biotechniques 
21(3):510-14, 516-19 [1996]; R, Makino et al, F-SSCP: fluorescence-based polymerase 
chain reaction-single-strand conformation polymorphism [PCR-SSCP], PCR Methods 

20 Appl. 2(1): 10-13 [1992]). An automated system may be used, such as an Applied 
Biosystems DNA sequencer, equipped with GENESCAN 672, Genetyper, or another 
appropriate analytical software package. 

Optionally, high throughput analysis is achieved by PCR multiplexing techniques 
well known in the art. (E.g., Z. Lin et al. Multiplex genotype determination at a large 

25 number of gene loci, Proc. Natl. Acad. Sci. USA 93(6):2582-87 [1996]). 

In another preferred embodiment of the present method, nucleotide sequencing is 
unnecessary for analyzing the amplification products. For example, heteroduplex 
analysis on high resolution gel matrices can be employed by the skilled practitioner to 
detect even single nucleotide polymorphisms. (M.T. Hauser et al. Generation of co- 

30 dominant PCR-based markers by duplex analysis on high resolution gels. Plant. J. 

16(1): 1 17-25 [1998]). The PCR/OLA procedure can be used for analyzing amplification 

products to detect SNPs in PAPSS2, (B.R. Glick and J. J. Pasternak, Molecular 
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Biotechnology: Principles and Applications of Recombinant DNA, ASM Press, 

Washington, D.C., pp. 197-200 [1994]). Conformation-sensitive gel electrophoresis of 

amplification products can also be employed as a means of analysis by the skilled artisan 

in practicing the methods of the present invention. (A. Markoff et al.^ Comparison of 

conformation-sensitive gel electrophoresis and single strand conformation polymorphism 

analysis for detection of mutations in the BRCAl gene using optimized conformation 

analysis protocols, Eur. J. Genet. 6(2): 145-50 [1998]). 

A skilled practitioner will know that electrophoresis for analyzing amplification 
products is done rapidly and with high sensitivity by using any of various methods of 
conventional slab or capillary electrophoresis, with which the practitioner can optionally 
choose to employ any facilitating means of nucleic acid fragment detection, including, but 
not Umited to, the use of radionuclides, biotin, UV-absorbance or laser-induced 
fluorescence. (K. Keparnik et al. Fast detection of a (CA)J8 microsatellite repeat in the 
Ig E receptor gene by capillary electrophoresis with laser-induced fluorescence 
detection. Electrophoresis 19(2);249-55 [1998]; H. Inoue et al. Enhanced separation of 
DMA sequencing products by capillary electrophoresis using a stepwise gradient of 
electric field strength, J. Chromatogr. A. 802(1): 179-84 [1998]; NJ. Dovichi, DNA 
sequencing by capillary electrophoresis. Electrophoresis 18(12-13):2393-99 [1997]; H. 
Arakawa et al. Analysis of single-strand conformation polymorphisms by capillary 
electrophoresis with laser induced fluorescence detection, J. Pharm. Biomed. Anal. 15(9- 
10): 1537-44 [1997]; Y. Baba, Analysis of disease-causing genes and DNA-based drugs 
by capillary electrophoresis. Towards DNA diagnosis and gene therapy for human 
diseases^ J Chromatgr B. Biomed. Appl. 687(2):271-302 [1996]; K.C. Chan et al. High- 
speed electrophoretic separation of DNA fragments using a short capillary, J. 
Chromatogr B. Biomed. Sci. Appl. 695(1): 13-15 [1997]). 

Any of diverse fluorescent dyes are optionally used to label primers or 

amplification products for ease of analysis, including but not limited to, fluorescein, 

SYBR Green I, YO-PRO-1, thiazole orange. Hex (i.e., 6-carboxy-2',4',7',4,7- 

hexachlorofluoroscein), FAM (i.e., 6-carboxyfluorescein), or TET (i.e., 4,7,2',7'- 

tetrachloro-6-carboxyfluoroscein). (E.g., J. SkeidsvoU and P.M. Ueland, Analysis of 

double-stranded DNA by capillary electrophoresis with laser-induced fluorescence 

detection using the monomeric dye SYBR green I, Anal. Biochem. 231(20):359-65 [1995]; 
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H. Iwahana et al^ Multiple fluorescence-based PCR-SSCP analysis using internal 

fluorescent labeling of PCR products, Biotechniques 21(30:510-14, 516-19 [1996]). 

Analyzing the amplification products can also be done by means of restricting the 
amplification products with one or more restriction enzymes, for example Hindi, 
followed by separation of the resulting fragments and analysis of fragment length or 
differential fragment migration in denaturing high-performance liquid chromatography 
(DHPLC) or gel electrophoresis, as above, including restriction-capillary electrophoresis. 
For example, this may be achieved by techniques known in the art, such as PCR- 
restriction fragment-SSCP, which can detect single base substitutions, deletions or 
insertions. (M. Tawata et al, A mass screening device of genome by polymerase chain 
reaction-restriction fragment-single strand conformation polymorphism analysis. Genet, 
Anal. 12(3-4): 125'-27 [1996]; H.H. Lee et al. Mutational analysis by a combined 
application of the multiple restriction fragment-single strand conformation polymorphism 
and the direct linear amplification DNA sequencing protocols. Anal, Biochem. 
205(2);289-93 [1992]). 

Variant alleles of a PAPSS2 gene may be of any nucleotide sequence defining an 
open reading frame, or part thereof, that encodes PAPSS2 protein, or of any nucleotide 
sequence, or part thereof, defining a regulatory region of a PAPSS2 gene. A. PAPSS2 
regulatory region may include, but is not limited to, a promoter region, an enhancer, a 
termination sequence, or any other nucleotide sequence, or part thereof, that regulates the 
transcription of PAPSS2 protein. Any mutation or polymorphism in a PAPSS2 gene that 
results in a loss of PAPSS2 expression or a decrease in PAPSS2 functional activity 
relative to a normal activity range, or which could do so if it were not masked by the 
presence in vivo of a fully functional PAPSS2 allele or by another compensatory 
physiologic mechanism, are contemplated by the present invention. 

For example, one variant allele of PAPSS2, characteristic of SEMD Pakistani- 
type, defines a stop codon instead of a serine codon corresponding to amino acid residue 
475 of SEQ. ID. N0.:7, and results in synthesis of a truncated PAPSS2 protein 474 amino 
acids long. Homozygosity for this variant allele is associated with SEMD Pakistani-type 
in a human subject, and thus corroborates a diagnosis of SEMD Pakistani type. 

However, detecting heterozygosity for a variant PAPSS2 allele is also useful, for 

genetic counseling purposes. Consequently, the present invention also relates to a 
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method of identifying a human carrier of an heritable allele associated with 

spondyloepimetaphyseal dysplasia. As in the present method of diagnosing 

spondyloepimetaphyseal dysplasia, the method of identifying a carrier involves 

amplifying a nucleic acid segment from a sample derived from a human subject without 

symptoms of spondyloepimetaphyseal dysplasia, to produce amplification products. As 

described above, the sample is of a bodily substance containing human nucleic acid of the 

subject. The nucleic acid segment defines a sequence from human chromosomal region 

10q23-24, between micro satellite markers D10S1143 and D10S2470. The amplification 

products are analyzed for the presence of a variant allele of a PAPSS2 gene, which is 

accomplished as described above. The presence of the variant allele of the gene 

identifies the human subject as a carrier of an heritable allele associated with 

spondyloepimetaphyseal dysplasia. A benefit of the present method of identifying a 

human carrier of an heritable allele associated with SEMD, is that couples in which one 

or both of the spouses is identified as a carrier of a SEMD-associated allele, can make 

more informed reproductive decisions. 

The present invention also relates to a gene therapy method for treating a human 

subject having an osteoarthritic disorder that is caused or aggravated by deficient 

enzymatic sulfation activity. Examples include spondyloepimetaphyseal dysplasia, 

Stickler syndrome, spondyloepiphyseal dysplasia, and achondrogenesis; achondroplasia; 

chondrodysplasia; diastrophic dysplasia; the collagen oligomeric matrix protein (COMP) 

disorders such as pseudoachondroplasia and multiple epiphyseal dysplasia. In one 

embodiment, the method involves exposing, in vitro, one or more cells derived directly 

(e.g., biopsied) or indirectly (e.g., cultured) from a tissue of a human subject having at 

least one symptom of an osteoarthritic disorder caused or aggravated by deficient 

enzymatic sulfation activity to a gene delivery mixture comprising at least one gene 

delivery agent and a nucleic acid construct having a nucleic acid segment encoding a 

PAPS synthetase protein having an amino acid sequence of (SEQ, ID. N0.:7) or (SEQ. 

ID. NO.: 8); causing the nucleic acid construct to be taken up by, and released into the 

cell(s), such that the nucleic acid construct is incorporated into the genome of the cell(s), 

whereby under appropriate physiological conditions expression of fiinctional PAPS 

synthetase protein is enhanced in the cell(s), compared to unmodified cells of the same 

kind. The genetically modified cells are then implanted into the tissue of the human 
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subject, where PAPS synthetase activity is thereby enhanced relative to pre-implantation, 

and at least one symptom associated v^ith the osteoarthritic disorder in the subject is 

improved. 

For purposes of treating symptoms of aberrant skeletal growth, the human subject 
is preferably a juvenile, and most preferably a prepubescent juvenile. However, for 
purposes of treating symptoms of joint degeneration associated with osteoarthritic 
disorders, such as SEMD or chondrodysplasia, and for stabilizing the cartilage matrix, the 
human subject is any age. In one embodiment, the gene therapy method for treating an 
osteoarthritic disorder caused or aggravated by deficient enzymatic sulfation activity is 
directed to symptom(s) related to deficient sulfation activity that may arise in liver cells, 
e.g., hepatocytes, in platelets, or in vascular tissue, and these symptoms are treated in 
accordance with the method at any age, depending only on the practitioner's best 
estimation of the physiologic resilience of the individual patient in facing the stresses of 
the procedure. 

These stresses are typically minimal, as cells can be obtained by pertcutaneous or 
laparoscopic biopsy. However, cells are also obtained by any other suitable surgical 
means or from a pre-existing culture derived from a tissue of the subject or other viable 
sample of the subject's cells. Implantation of the genetically modified cells is 
accomplished by conventional percutaneous, laparoscopic, stereotactic, or other surgical 
methods. 

The method involves genetically modifying a cell(s) derived from a tissue of the 
subject. Preferably, but not necessarily, this is a cartilage-forming cell, such as a 
chondrocj^e from an epiphyseal region. However, other cell types can also be genetically 
modified in accordance with the method to improve osteoarthritic symptoms that may be 
associated with other tissues, for example hepatocytes, platelets, or vascular endothelium. 
The cells, for example, chondrocytes, hepatocytes, hematopoietic precursor cells, or 
vascular endothelial cells, are obtained from an appropriate tissue, such as the epiphyses, 
joints, liver, bone marrow, or vascular tissue. 

The cell(s) are genetically modified. This can be by any suitable gene delivery 

method as described above, preferably in vitro, for example by transfection, transduction, 

or electroporation, but in vivo methods of gene delivery are also contemplated. The 

cell(s) is exposed to a gene delivery mixture comprising at least one gene delivery agent, 
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as described above, and a nucleic acid construct, preferably DNA, containing a nucleic 

acid segment encoding a PAPS synthetase protein having an amino acid sequence of 

(SEQ- ID. NO.:7), for example (SEQ. ID. NO.:9X or containing a nucleic acid segment 

encoding a PAPS synthetase protein having an amino acid sequence of (SEQ. ID. NO.: 8), 

for example (SEQ. ID. NO.: 10), operatively linked, in a transcriptional unit, to a 

predetermined promoter sequence that operates in the cell. 

The nucleic acid construct that is used in the present gene therapy method 
optionally contains a reporter gene for convenient detection, isolation or selection of 
genetically modified cells expressing PAP synthetase protein from an exogenously 
supplied gene, as described above. 

The construct is then taken up by and released into the cell, and is incorporated 
into the genome of the cell. Under appropriate physiological conditions, transcription of 
the PAPSS2 or Papss2 encoding region occurs, followed by translation, and thus, 
expression of PAPSS2 or Papss2 protein is enhanced in the cell, compared to unmodified 
cells of the same kind. With the cell(s) implanted in the tissue of the subject, one or more 
symptoms of the osteoarthritic disorder in the subject are thereby improved due to 
enhanced enzymatic sulfation in the tissue of biochemicals that require sulfation for 
bioactivity or have increased bioactivity in humans in a sulfated form. 

Under some circumstances, the practitioner may deem it desirable to apply the 
present gene therapy method to a fetal subject that has been identified as being 
homozygous for a variant allele of a PAPSS2 gene by the present method of diagnosing 
spondyloepimetaphyseal dysplasia. In such instances, chondrocytes and stem cell 
precursors of chondrocytes can be obtained from fetal cartilage, can be genetically 
modified and re-implanted while the subject is in utero. 

Another embodiment of the gene therapy method of the present invention involves 

a gene modification strategy employing a chimeric RNA/DNA oligonucleotide construct, 

particularly useful in repairing SNPs associated with a SEMD phenotype, such as the 

allele encoding a stop codon instead of serine at position 475 of (SEQ. ID. N0.:7) (e.g., 

"TAA" instead of "TCA" at nt. +1423 through +1425 of SEQ. ID. NO.:l; see Figure 3), 

associated with SEMD Pakistani type. (E.g., Kren, B.T., et at. Gene repair using 

chimeric RNA/DNA oligonucleotides, Semin. Liver Dis, 19(1):93-104 [1999]; Alexeev, 

V. & Yoon, K., Stable and inheritable changes in genotype and phenotype of albino 
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melanocytes induced by an RNA-DNA oligonucleotide, Nat. Biotechnol 16(13): 1343-46 

[1998]; Cole- Strauss, A.., et al. Targeted gene repair directed by the chimeric KNA/DNA 

oligonucleotide in a mammalian cell free extract^ Nucleic Acids Res. 27(5); 1323-30 

[1999]; Xiang, Y., et al. Targeted gene conversion in a mammalian CDS 4^ -enriched cell 

population using a chimeric RNA/DNA oligonucleotide, J. Mol. Med. 75(1 1-12):829-35 

[1997]; Cole- Strauss, A.., et at. Correction of the mutation responsible for sickle cell 

anemia by an RNA-DNA oligonucleotide. Science 273(5280): 1386-89 [1996]; Yoon, K., 

et ah. Targeted gene correction of episomal DNA in mammalian cells mediated by a 

chimeric RNA.DNA oligonucleotide, Proc. Natl. Acad. Sci. USA 93(5):2071-76 [1996]). 

Briefly, this gene modification strategy exploits the recombinogenicity of RNA-DNA 

hybrids and features the use of chimeric RNA/DNA oligonucleotides with hairpin capped 

ends that avoid destabilization or destruction by cellular helicases or exonucleases; an 

RNA/DNA oligonucleotide is designed so that it perfectly aligns with a specific genomic 

target sequence except for a single mismatched basepair, which is recognized by 

endogenous DNA repair systems that can then convert the SNP to a desired normal 

sequence. 

In this embodiment, the cells are exposed to a nucleic acid construct that includes 
a chimeric RNA/DNA oligonucleotide. This chimeric RNA/DNA comprises a nucleic 
acid segment of ^PAPSS2 gene comprising a nucleotide sequence of (SEQ. ID. NO.:9), 
(SEQ. ID. NO.: 10), or a gene-specific fragment of either of these. The nucleic acid 
construct is taken up by, and released into, the cell, and the nucleotide sequence of the 
nucleic acid segment is incorporated into the genome of the cell, whereby expression of 
functional PAPS synthetase protein is enhanced in the cell. Under appropriate 
physiological conditions, transcription of the PAPSS2 or Papss2 encoding region occurs, 
followed by translation, and thus, expression of PAPS synthetase protein is enhanced in 
the cell, compared to unmodified cells of the same kind. The genetically modified cell(s) 
is then implanted into the tissue of the human subject, as described above. With the 
cell(s) implanted in the tissue of the subject, one or more symptoms of the osteoarthritic 
disorder in the subject are thereby improved due to enhanced enzymatic sulfation in the 
tissue of biochemicals that require sulfation for bioactivity or have increased bioactivity 
in humans in a sulfated form. 
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Thus, the gene therapy method provides a much needed way to improve or 

ehminate symptoms of aberrant skeletal grov^h or joint degeneration associated with 

osteoarthritic disorders such as SEMD. The method provides a way to improve 

symptoms resulting from deficient sulfation activity in other cell types, such as 

hepatocytes, platelets, or vascular epithelium, for example, abnormally long bleeding 

time. 

The present invention is also related to a protein therapy method for treating a 
human subject having an osteoarthritic disorder. The osteoarthritic disorder is one that is 
caused or aggravated by deficient enzymatic sulfation activity, as described above. The 
method employs the inventive PAPSS2 fiision protein. The method involves exposing a 
cell(s) of a tissue of a human subject having the osteoarthritic disorder to a fusion protein 
comprising a first PAPSS2 polypeptide segment that comprises an amino acid sequence 
of (SEQ. ID. NO.; 7), or an enzymatically active fragment thereof, and a second 
polypeptide segment capable of infiltrating the cell, whereby the fusion protein is taken 
up by the cell and the PAPSS2 polypeptide segment is enzymatically active within the 
cell. Exposing the cell(s) is preferably accomplished in vivo, for example by injecting or 
infusing the fusion protein into the subject. An example of a second polypeptide segment 
capable of infiltrating a diversity of cell types is an human immunodeficiency virus (HIV) 
TAT protein, (Schwarze, S.R., et al [1999]). But a tissue-specific second polypeptide 
can also be used. 

The skilled artisan is aware of suitable means of detecting an improvement in the 
subject's symptom(s) resulting fi-om treatment by the inventive genetic or protein therapy 
methods. For example, a useful indicator of improvement in symptoms of aberrant 
skeletal grov^h is a comparison of the subject's grov^h rate over a period of months or 
years after treatment using the method, with the rate of growth before treatment. It is also 
useful to compare the subject's growth over time after treatment with standard growth 
curves or tables for humans of the same age and sex. Improvement in joint function, for 
example reduction in joint pain or enhanced joint movement, or joint appearance on x- 
ray, are useful indicators. Another useful indicator is improvement in platelet ftinction, 
which is detected in normalized or shortened bleeding time for the subject. 

Alternatively, biochemical or molecular methods are employed to detect 

improvement. For example, assays showing increased sulfation activity in a tissue biopsy 
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sample(s) from the subject, such as in cartilage, Uver, or blood samples and/or extracts, 

are useful in determining the effectiveness of the method. Commonly, such assays are 

based on direct measurement of in vivo and/or in vitro PAPS synthetase enzymatic 

activity or detecting the assimilation of sulfate into APS (i.e., sulfate activation assay) or 

into other sulfated metabolites, for example measuring the degree of sulfation of aggrecan 

side chains. Radiolabeled (e.g., S) sulfate and chromatographic separations are 

common tools used in such assay methods. Detection of PAPSS2-spQcific mRNA 

transcripts in tissue samples, using the probes or primers of the present invention, can 

also be employed as an indirect measure of improvement. 

Alternatively, the nucleic acid construct that is used in the present gene therapy 
method can optionally contain in the transcriptional unit a reporter gene, as described 
above, for convenient detection of genetically modified cells expressing exogenous 
PAPSS2 in a tissue biopsy sample(s) from the subject. 

The present invention is also related to a genetic testing kit for diagnosing SEMD 

in a human subject. The kit is an assemblage of materials for facilitating amplifying 

nucleic acids from a human subject comprising PAPSS2 nucleotide sequences and/or 

analyzing PAPSS2-dQmed amplification products. A genetic testing kit of the present 

invention comprises at least one oligonucleotide primer of the present invention and 

preferably comprises a pair of oligonucleotide primers of the present invention, together 

with instructions for the practice of the methods of the present invention. A most 

preferred embodiment of the genetic testing kit of the present invention comprises an 

inventive pair of oligonucleotide primers, as described above, for example (SEQ. ID. 

NOS.:3 and 4) or (SEQ. ID. NOS,:5 and 6). A preferred embodiment of the genetic 

testing kit of the present invention can incorporate an array of oligonucleotide primers 

specific for SNPs in human PAPSS2, preassembled in a "DNA chip" (or "gene chip") 

configuration for facilitating the amplifying of nucleic acids and the analyzing of 

amplification products. (E.g., J.G.Hacia et al. Enhanced high density oligonucleotide 

array-based sequence analysis using modified nucleoside triphosphates^ Nucleic Acids 

Res. 26(2):4975-82 [1998]; R.W. Wallace, DNA on a chip: serving up the genome for 

diagnostics and research, Mol. Med. Today 3(9):384-89 [1997]; T. Pastinen et ah, 

Minisequencing: a specific tool for DNA analysis and diagnostics on oligonucleotide 

arrays. Genome Res. 7(6):606-14 [1997]; M.T. Cronin et al. Cystic fibrosis mutation 
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detection by hybridization to light-generated DNA probe arrays^ Hum. Mutat. 7(3):244- 
55 [1996]; A.C. Pease et ah. Light-generated oligonucleotide arrays for rapid DNA 
sequence analysis, Proc. Natl Acad. Sci. USA 91(ll):5022-26 [1994]; E.M. Southern et 
ah. Arrays of complementary oligonucleotides for analyzing the hybridisation behaviour 
of nucleic acids. Nucleic Acids Res. 22(8): 1368-73 [1994]). 

The present invention also relates to a kit for genetically modifying a vertebrate 
cell. The kit is a ready assemblage of materials or components for facilitating the genetic 
modification of a vertebrate cell. The kit includes a polynucleotide comprising a PAPSS2 
sequence having SEQ. ID. NOS.;l, or a Papss2 sequence having SEQ. ID. NO.:2, a 
sequence complementary to either of these, or a degenerate coding sequence thereof (i.e., 
encoding SEQ. ID. NOS.:7 or 8), or a gene-specific fragment of any of these, as described 
herein with respect to the nucleic acid construct of the present invention. Preferably the 
polynucleotide includes a transcriptional unit that contains the human PAPSS2- or murine 
Papss2-encoding nucleic acid segment, operatively linked to a functional promoter 
sequence, and/or a reporter gene for facilitating detection, isolation, or selection of 
genetically modified cells from unmodified cells. The kit optionally contains a suitable 
gene delivery agent. The kit is particularly useful in practicing the gene therapy method 
for treating a human subject having an osteoarthritic disorder that is caused or aggravated 
by deficient enzymatic sulfation activity. 

However, in several embodiments, the kit includes a DNA sequence encoding the 
PAPSS2 or Papss2 protein, or fragment thereof at least 6 amino acids long, in a sense or 
antisense orientation, as appropriate for a particular application. Alternatively, the kit 
includes ?iPAPSS2- or Pa/?552-specific chimeric RNA/DNA oligonucleotide, as described 
above. Some embodiments of the kit are configured for use in practicing the present 
method of treating a human subject having an osteoarthritic disorder, as described above. 

The present invention also relates to a kit for practicing the inventive method of 
protein therapy for treating a human subject having an osteoarthritic disorder that 
employs the inventive PAPSS2 fusion protein, as described above. The kit contains a 
fusion protein comprising a first PAPSS2 polypeptide segment that comprises an amino 
acid sequence of (SEQ. ID. NO.:7), or an enzymatically active fragment thereof, and a 
second polypeptide segment capable of infiltrating the cell, whereby the fusion protein is 
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taken up by the cell and the PAPSS2 polypeptide segment is enzymatically active within 
the cell. An example of a second polypeptide segment capable of infiltrating cell(s) is an 
human immunodeficiency virus (HIV) TAT protein, v^hich can infiltrate diverse cell 
types, but other polypeptide segments are also useful. 

The materials or components assembled in the present kits for diagnosing SEMD 
or identifying a carrier of a heritable marker associated with SEMD and kits for 
genetically modifying a vertebrate cell or for protein therapy can be provided to the 
practitioner stored in any convenient and suitable ways that preserve their operability and 
utility. For example the components can be in dissolved, dehydrated, or lyophilized 
form; they can be provided at room, refrigerated or frozen temperatures. The kits of the 
present invention include instructions for using the materials or components effectively 
for their intended purpose(s). 

It is contemplated that the polynucleotides, nucleic acid constructs, primers, 
primer pairs, and kits of the present invention can be used therapeutically to detect SEMD 
in patients with specific variant alleles of PAPSS2. These variant alleles can be targeted 
for gene therapy directed at repairing malfunctional PAPSS2 alleles or they may suggest 
specific chemotherapy directed to enhancing endogenous PAPS synthetase activity or 
may be directed to other metabolic targets in biochemical pathways mediated by PAPS 
synthetase activity. 

The foregoing descriptions of the isolated polynucleotides, nucleic acid 
constructs, genetically modified cells and vertebrates, methods, and kits of the present 
invention are illustrative and by no means exhaustive. The invention will now be 
described in greater detail by reference to the following non-limiting examples. 

EXAMPLES 

Example 1: Methods . 

Genotyping . Synthetic oligonucleotide primers were obtained for genotyping in the 
SEMD family with polymorphic microsatellite markers from the CHLC Human 
Screening SetAVeber version 8 (Research Genetics). Additional markers were derived 
from a physical map of chromosome 10 (Gray, I.C, et al. An integrated physical and 
genetic map spanning chromosome band 10q24, Genomics 43:85-88 [1997]) and 
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databases from the Whitehead Institute for Biomedical Research, the Cooperative Human 
Linkage Center, and Genethon. 

Similar methods were used for analysis of microsatellite markers in the 
brachymorphic mice, using DNA samples from backcross animals recombinant in the bm 
region. (Rusiniak, M.E., et ah. Molecular markers near the mouse brachymorphic (bm) 
gene, which affects connective tissues and bleeding time, Mamm. Genome 7:98-102 
[1996]). Markers were derived from the Whitehead Institute for Biomedical Research 
mouse genetic and physical mapping database (http://www-genome.wi.mit.edu/cgi- 
bin/mouse/index). 

Linkage Calculations . Two-point linkage analyses were conducted for eight markers in 
the region of D10S1143 (see Figures 1 and 2 and Table 1). LOD scores were calculated 
with the MLINK subroutine of the LINKAGE 5.1 package (Terwilliger, J.D. & Ott, J,, 
Handbook of human genetic linkage, Baltimore: John Hopkins University Press [1994]) 
using a Pentium PC. Because of the computational problems introduced by extensive 
inbreeding and multiple generations with no genotyping or phenotyping data, the SEMD 
pedigree was divided into three separate sub-pedigrees. Individuals in the three nuclear 
families from the most recent generation, including each of their ancestors back to the 
founding couple as well as all inbreeding loops, were analyzed separately. The 
genotypes of each individual were analyzed only once. 

SEMD was modeled as an autosomal recessive fully penetrant disease with an 
allele frequency of .0001. LOD scores were summed across the pedigrees. The LOD 
scores of -□ reflect recombinations within nuclear families for markers D10S2470 and 
DI OS J 753. Other markers did not recombine within the nuclear families, so that q = 0.0 
could not be ruled out for them, although parsimony indicates that recombination has 
probably occurred within the 

pedigree for each of these markers. The maximum LOD scores for markers D10S1I43 
and D10S2470 at q=.01 were consistent with the results of the haplotype analysis. 
Because marker allele frequencies in the Pakistani population are unknown, for each 
marker, the allele frequencies were set at 1/N, where N was the number of observed 
alleles in the pedigree, 
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Radiation Hybrid Mapping . DNA from the Genebridge 4 Radiation Hybrid (RH) panel 
was obtained from Research Genetics. For the PAPSS2 gene, a 236 bp PGR product was 
amplified using primers 5'-CTGGTGCTGGAAAAACAACG-3' (SEQ, ID. NO : 5; 
forward primer) and 5'-TGCGAATGGAGAA ATAAAGCTG-3' (SEQ. ID. NO.:6; 
reverse primer). RH data were submitted to the Whitehead server (http://www- 
genome.wi.mit.edu/cgi-bin/contig/rhmapper.pl) for placement on the RH map. 

Computer Methods . BLAST searches (Altschul, S.F.,e^ al. Gapped BLAST and P SI- 
BLAST: a new generation of protein search programs. Nucleic Acids Res. 25:3389-3402 
[1997]) and determinations of sequence homology were performed using the National 
Center for Biotechnology Information (http://www.ncbi, nlm.nih.gov/BLAST/) and 
Baylor College of Medicine (http://dot.imgen.bcm.tmc. edu:9331/seq-search/nucleic_acid- 
search.html) servers. The phenoytpic map of the mouse genome was accessed through 
the Mouse Genome Informatics server (http://www.informaticsjax.org), maintained by 
the Jackson Laboratory. 

cDNA library analysis . To isolate the human PAPSS2 cDNA clones, oligonucleotide 
primers B (SEQ. ID. NO.:12, below) and E (SEQ. ID. NO.:15, below) were designed 
from the EST sequences identified by BLAST searching to ampUfy an 826 bp cDNA 
fragment by RT-PCR from human fetal cartilage RNA. The fragment was hybridized to 
plaques from a directional human fetal cartilage cDNA library constructed by Stratagene 
in the lambda ZAP2 vector (D, Krakow and D. Cohn, unpublished). Alternatively, a 
forward primer (SEQ. ID. NO: 11, below) and a reverse primer 5'- 
GCATGTCCAGACAGACACCAC-3' (SEQ, ID. NO.:28) were used to produce, from 
fetal cartilage RNA, a cDNA sequence 1940 base pairs long, including the entire PAPSS2 
coding region and flanking regions, which nucleic acid segment was then used as a 
hybridization probe of the fetal cartilage cDNA library. 

Primers used for sequence analysis of the cDNA clones were: 

(A) 5'-GCCAGCCAGCATGTCGGGGAT-3'(SEQ. ID. NO.:ll; forward primer); 

(B) 5'-ACCTGAAACTCCTGAGCGTGTGCT-3' (SEQ. ID. NO.:12; forward primer); 

(C) 5'-GATGTGCCTCTAGACTGGCGG-3' (SEQ. ID. NO.:13; forward primer); 

(D) 5'-GAGCACTTCAGAAAGAAACTCTGG-3'(SEQ. ID. NO.:14; reverse primer); 
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(E) 5'-CATCCGCCAGTCTAGAGGCAC-3' (SEQ. ID. NO.:15; reverse primer); 

(F) 5'-AGGTGTCAGACGGTATTGGTC-3' (SEQ. ID. NO.:16; reverse primer); 

(G) 5'-GTCACTCACTGTGGACAAATTGG-3'(SEQ. ID. NO.:17; reverse primer); and 

(H) 5'-CACCTCAGCAATCCGGCGGAT-3' (SEQ. ID. NO.: 18; reverse primer). 

The sequence of 3' end of the coding region of the mouse Papss2 cDNA was 
identified by database searching. The 5' end of the cDNA was isolated from a mouse 
brain cDNA library, constructed from the C57BL/6 strain in the Uni-ZAP XR vector 
(Stratagene), by amplification from pooled library DNA using a gene specific primer 
derived from the sequence at locus D19Mitl3 (5'-TCTGGCACAAAGAGTTCGTG-3' 
[SEQ. ID. NO.: 19; reverse primer]) and a vector-specific primer. To amplify the entire 
coding region of Papssl, oligonucleotide primers from the 5' ([a] 5'- 
GCCAGTTTGTAAC CGAGTATTC-3' [SEQ. ID. NO.:20; forward primer]) and 3' ([b] 
5'-GCAATTGGATACAGAGCAGC TA-3' [SEQ. ID. NO.:21; reverse primer]) 
untranslated regions were used. Sequence was determined using the 5' and 3' 
untranslated region primers as well as primers: 

(c) 5'-GACAATGTCCGTCATGGCCTTA-3' (SEQ. ID. NO.:22; forward primer); 

(d) 5'-ATTCCCATTGTATTGCCCGTT-3' (SEQ. ID. NO.:23; forward primer); 

(e) 5'-AACGGGCAATACAATGGGAAT-3' (SEQ. ID. NO.:24; reverse primer); and 

(f) 5'-GATAAAGCTGGTGATGCAAACC-3' (SEQ. ID. NO.:25; reverse primer). 

Mutation analysis . To define the human mutation, RNA was purified from a 
lymphoblastoid cell line from an affected family member. RT-PCR using either the 
primers from the 5'- and 3'- untranslated regions (primers A and D, above) or internal 
primers were used to amplify cDNA fragments containing the entire coding region. 
Control fragments were amplified from human cartilage RNA. Direct sequence analysis 
of the PGR products using the primers listed above was used to identify the mutation. To 
confirm the mutation and examine segregation in the Pakistani family, we used HincH 
digestion of a 108 bp genomic DNA fragment amplified with primers 5'-TGGA 
CC AAGGATGACGATGT-3 ' (SEQ. ID. NO. :3 ; forward primer) and 
5'-CGGAAAGATGGCAACAATGG-3' (SEQ. ID. NO.:4; reverse primer). 
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To define the mouse mutation, RNA was purified fi"om fi^ozen spleen firom the 
brachymorphic mouse and a PWK control. (Rusiniak et ah [1996]). The Papssl coding 
region was amplified in two fi-agments by RT-PCR using primer pair (a) (SEQ. ID. 
NO.:20; forward primer) plus (g) 5'-CATGG GATGGCGTGAGAT AC-3 ' (SEQ. ID. 
NO.:26; reverse primer) and primer pair (b) (SEQ. ID. NO.:21; reverse primer) plus (h) 
5'-CATAAGCTTTGCTTT GGAAGAGT-3' (SEQ. ID. NO,:27; forward primer), and the 
sequence was determined using the primers (c)-(f) listed above. 

Example 2: Linkage of SEMD Pakistani type to markers on human chromosome 10 . 

A pooling strategy was used to carry out a genome-wide search for the 
chromosomal region containing the disease gene in a large inbred Pakistani SEMD family 
(Ahmad et al, 1998, incorporated by reference). A total of 381 markers were typed, 
ultimately achieving an average marker spacing of about 10 cM. Individual genotypes for 
all studied family members were determined for 83 markers that showed differences in 
allele distributions between the affected and unaffected pools. For all but three of these 
markers, D10S1432, DJ0S2 4 70, and D10S677, linkage to the SEMD phenotype could be 
excluded. Significant hnkage was initially obtained for the marker at locus D10S2470, 
yielding a maximum two-point lod score of 5,44 at a recombination fraction of 0.01. 

Saturation genotyping with additional markers from the region confirmed the 
initial linkage result. Two-point lod scores for all markers are shown in Table 1, with the 
highest lod score of 5.51 at a recombination fraction of 0.01, obtained for the marker at 
locus DIOSI 143, 

Table 1. Linkage Analysis of Chromosome 10 Markers and SEMD. 
LOD Score at q = 



Marker 


0.0 


0.001 


0.01 


0.05 


0.1 


0.2 


0.3 


0.4 


D I OS 1 689 


-0.20 


-0.12 


0.19 


0.45 


0.41 


0.20 


0.10 


0.02 


D10S1717 


2.85 


3.02 


3.52 


3.62 


3.26 


2.39 


1.50 


0.66 


Dl OS 1644 


4.28 


4.44 


4.88 


4.76 


4.12 


2.78 


1.63 


0.67 


D10S1687 


4.23 


4.41 


4.87 


4.77 


4.18 


2.85 


1.66 


0.67 


Dl OS 17 39 


2. 16 


2.43 


3.02 


3.08 


2.69 


1.81 


1.02 


0.34 


D10S1143 


4.89 


5.06 


5.51 


5.35 


4.66 


3.14 


1.79 


0.71 



42 

LAI 351812v2 



Patent 18810-81553 

D10S2470 -oo 4.63 5.44 5.31 4.60 3.04 1.67 0.67 
D10S1753 -oo -0.57 1.02 1.80 1.74 1.20 0.66 0.20 

A proximal boundary at locus Dl OS 1689 was established by the haplotypes of 
three individuals, VI-2, VI-4 and VIII-2. A distal hmit at locus D10S2470 was 
established by a recombination observed in VIII-15. Together, these recombinant events 
localized the disease gene within a maximum interval of 8.1 cM, flanked on the proximal 
side by the marker at locus Dl OS 1689 and on the distal side by the marker at locus 
D10S2470. The genotype of individual VI-7 provided clear evidence of recombination 
within the 8.1 cM interval. Compatible with the recessive inheritance pattem for the 
disease, he was homozygous for the marker at only one locus, D 1082470^ within the 
maximum interval. Together with the distal boundary of the larger interval established by 
the recombination in individual VIII-15j the data showed that the disease gene was 
contained within the 1.7 cM separating the markers at loci D10S1143 and D10S2470, 

Figure 1 shows a genetic map of the interval containing the SEMD disease gene 
on human chromosome 10. The darker bar identifies the 8.1 cM maximum interval 
between loci Dl OS 1689 and D10S2470, The 1.7 cM minimum interval is identified by 
the darkest bar, between loci D10S1143 and D10S2470, The locations of D10S1432 and 
D10S677 are not drawn to scale. 

Example 3 : A novel ATP sulfurvlase/kinase gene in mice and humans > 

The SEMD interval established by linkage corresponded to chromosomal region 
10q23- 

24, which is syntenic with the region of mouse chromosome 19 to which the recessive 
mouse skeletal mutant brachymorphic was localized. This regional homology implied 
that the two phenotypes could be due to mutations in an orthologous pair of genes. 
Additional recombination mapping using recombinant mice from a large backcross with 
the brachymorphic mouse showed that only 3 of 25 mice recombinant at the previously 
defined proximal boundary (Rusiniak, M.E,, et al. Molecular markers near the mouse 
brachymorphic (bm) gene, which affects connective tissues and bleeding time^ Mamm. 
Genome 7:98-102 [1996]), the markers at loci D19Mit39 and D19Mit57, were 
recombinant at locus D19Mitl35, The distal limit was not further narrowed by 
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recombination mapping, leaving a total estimated interval of about 0.7 cM, flanked by the 
markers at loci D19Mitl35 and D19Mit64^ containing the bm mutation. (Figure 2). 

As in prior genetic mapping studies (Rusiniak et al [1996]), the marker at locus 
D19Mitl3 was the only marker non-recombinant with the brachymorphic phenotype and, 
because of the large size of the cross, was therefore expected to be extremely close to the 
disease gene. A BLAST search using D19Mitl3 as the query sequence was carried out to 
test the hypothesis idea that the marker might be within an intron of the bm gene. 
Sequences on both sides of the D19Mitl3 CA repeat element were about 80% 
homologous to a human expressed sequence tag (EST), zc38a08.rl, which was distinct 
from but similar to the known PAPS synthetase gene, Papssl. Comparison of the EST 
sequence with D19MitI3 revealed that the CA repeat was contained within a 180 base 
pair intron bordered by consensus splice donor and splice acceptor sequences (data not 
shown). Further sequential BLAST searches using both the human PAPSSl and murine 
Papssl cDNA sequences, as well as EST sequences derived from each search, identified 
sets of overlapping ESTs from both species that could be used to assemble partial cDNA 
sequences. The ESTs were derived from a variety of tissue and cell types, including 
colon, lung, placenta, aortic endothelium, and fetal heart, suggesting a widely expressed 
gene. (Figure 4). 

To complete the human cDNA sequence, the nucleotide sequences of PAPSS2 
cDNA clones isolated from a human fetal cartilage cDNA library were determined- The 
reading frame was determined by comparison with the paralogous human PAPSSl and 
mouse Papssl cDNA sequences. At the 5' end, an in-frame ATG codon immediately 
preceded by a sequence that fit the consensus sequence for translation initiation (Kozak, 
M., An analysis of 5-noncodmg sequences from 699 vertebrate messenger RNAs, Nucleic 
Acids Res. 15:8125-48 [1987]) was assumed to represent the first translated codon. 
Conceptual translation of the 615 codon open reading frame predicted a molecular weight 
of 69,448 Da for the unmodified protein. The coding regions of the human PAPSS2 and 
PAPSSl cDNA sequences were 71% identical and the primary translation products were 
78% identical. Radiation hybrid mapping localized PAPSS2 at 519 cR on the 
chromosome 10 map, between loci AFM225YD12 and D10S583, and within the SEMD 
region defined by linkage. 
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Sequential BLAST searching and sequence analysis of PGR products from a 
mouse brain cDNA library was used to complete the coding sequence of the mouse 
Papssl cDNA. At the 5' end, there were two in- frame ATG codons, each preceded by a 
possible consensus sequence for translation initiation. An in-frame stop codon was 
located just upstream from the first ATG, further supporting the inference that one or both 
of the two ATG codons represent the site(s) of translation initiation. For purposes of 
comparison with the human orthologue, we have assumed that the first ATG is used. The 
open reading frame contains 617 codons and a molecular weight of 69,814 Da was 
calculated for the encoded protein. The coding regions of the human PAPSS2 and mouse 
Papssl genes were 86% identical at the cDNA level and 93% identical at the protein 
level. 

Example 4: A stop codon in the human PAPSS2 gene in the Pakistani SEMD family . 

The coding region of the human PAPSS2 cDNA was amplified by RT-PCR using 
RNA derived from a lymphoblastoid cell line from an affected member of the Pakistani 
SEMD family. The only sequence change relative to the control fragment, amplified 
from human fetal cartilage RNA, was a point mutation that predicted a stop codon in 
place of the serine codon for residue 475 of the protein. (Figure 3). The mutation thus 
predicted truncation of the protein from 615 to 474 amino acids. The mutation disrupted 
a Hindi restriction endonuclease cleavage site, and analysis of an amplified genomic 
DNA fragment containing the site of the mutation by HincW digestion demonstrated 
cosegregation of the mutation and the phenotype in the family; all affected individuals 
were homozygous for the mutation, and all obligate carriers (unaffected offspring of 
affected individuals and unaffected parents of affected individuals) were heterozygous for 
the mutation (data not shown). None of a panel of 43 DNA samples from individuals of 
Pakistani origin carried the mutation. 

The data presented here demonstrate that a mutation in a novel gene in the sulfate 
activation pathway results in a human osteochondrodysplasia phenotype. The PAPSS2 
mutation characterized in the Pakistani SEMD family introduced a nonsense mutation at 
the codon for residue 475 of the 615 amino acid protein. It is unknown whether, as has 
been observed for many other genes (e.g., Willing, M.C., et al. Premature chain 
termination is a unifying mechanism for COLlAl null alleles in osteogenesis imperfecta 
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type I cell strains^ Am. J. Hum. Genet. 59:799-809 [1996]), the nonsense mutation leads 
to destabilization of the transcript, or if a truncated protein, perhaps retaining partial 
activity, is synthesized. 

Even though the PAPSS2 gene is expressed in a wide variety of tissues (Figure 4), 
the mutation produces an exclusively skeletal phenotype. Thus, similar to the 
disproportionate effect on cartilage of mutations in the gene encoding the v^idely 
expressed sulfate transporter DTDST (Hastbacka, J., et al [1994]; Rossi, A., et al 
[1998]), the cartilage-specificity of phenotypic expression is likely the result of the high 
demand for sulfation of extracellular matrix molecules in cartilage. Differences in levels 
of expression of the ATPSKl and PAPSS2 transcripts among different tissues 
(unpublished data) suggest that this may also influence the tissue-specificity of the 
phenotype. In cartilage, both PAPSSl and PAPSS2 transcripts are expressed 
(unpublished data), but the relative abundances of the proteins in vivo, and their 
contributions to net PAPS synthesis, are unknow^n. The milder effect of a mutation in 
PAPSSl as compared with DrD^ST mutations implies that PAPSSl partially compensates 
for t\\e PAP SS2 defect by providing a physiologically significant level of PAPS synthesis 
in cartilage. 

Example 5: An Papss2 missense mutation in the brachymorphic mouse . 

Complete sequence analysis of the Papss2 coding region from the brachymorphic 
mouse and the PWK strain used in the backcross identified ten single nucleotide sequence 
differences between the strains [A at base 295 (A295), G342, G386, C649, G881, C1086, 
G1239, T1524, G1557, and T1815]. Of these, only two implied sequence differences at 
the protein level, predicting G79R and K109R substitutions, both within the APS kinase 
domain of the protein. As the normal mouse strain C57BL/6 was also homozygous for 
arginine at residue 109, the G79R was the likely pathogenic substitution. A glycine 
codon is present in the same position among APS kinase domains of eucaryotic Papss 
genes as well as among APS kinases of bacteria, suggesting it is essential for APS kinase 
activity. 

The conclusion that the G79R Papss2 substitution produces the brachymorphic 

phenotype is supported by the following evidence. First, the biochemical defect in PAPS 

synthetase activity measured in brachymorphic cartilage and liver tissue extracts, showing 
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markedly decreased APS kinase activity and a lesser effect on ATP sulfurylase activity, 
predicts a structural defect affecting the kinase domain of the protein. As the G79R 
substitution is within the kinase domain, these data are compatible with the biochemical 
evidence and the hypothesis that the abnormal enzyme does not properly channel APS to 
the kinase. (Lyle, S., et al^ Sulfate-activating enzymes in normal and brachymorphic 
mice: evidence for a channeling defect, Biochemistry 34:940-45 [1995]). Second, the 
decrease in Papss activity in liver and cartilage (Schwartz, N.B., et al [1978]; Sugahara, 
K. & Schwartz, N.B. [1979]; Sugahara, K. & Schwartz, N.B., Defect in 3'- 
phosphoadenosine 5^-phosphosulfate synthesis in brachymorphic mice. 11. tissue 
distribution of the defect. Arch. Biochem. Biophys. 214:602-09 [1982]) is correlated with 
the human tissues in which Papss2 is highly expressed (unpublished data). Third, the 
characterized mutation is the only structural mutation in the coding region that is unique 
to the brachymorphic mouse. Fourth, the mutation results in substitution for a highly 
conserved residue among other Papss proteins and APS kinase domains of species from 
vertebrates to bacteria, suggesting it is essential for kinase activity. Fifth, the phenotype 
of the brachymorphic mouse and the human SEMD family are substantially similar. Both 
are recessively inherited dwarfing conditions that affect the spine and long bones. (Lane 
& Dickie [1968]; Ahmad, M., et al [1998]). Both phenotypes are evident at birth, but 
disproportion increases postnatally. Radiographic studies in SEMD Pakistani type 
(Ahmad, M., et al [1998]) and histologic analysis of brachymorphic mice (Orkin et al 
[1977]) demonstrate that both phenotypes are characterized by a primarily epiphyseal 
dysplasia with only mild metaphyseal abnormalities. Although brachymorphism is 
produced by a missense mutation and SEMD by a nonsense mutation, the phenotypic 
similarity argues that an Papss2 null allele is likely to produce a mouse 
osteochondrodysplasia phenotype very similar or equivalent to brachymorphic. 

The foregoing examples being illustrative but not an exhaustive description of the 
embodiments of the present invention, the following claims are presented. 
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